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Foreword 



rd , 



This Technical Specification has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 

The 3GPP transparent end-to-end packet-switched streaming service (PSS) specification consists of seven 3GPP TSs: 
3GPP TS 22.233 [1], 3GPPTS 26.233 [2], 3GPP TS 26.234 [3], 3GPP TS 26.244 [4], 3GPP TS 26.245 [5], 3GPP 
TS 26.246 [6], and the present document. 

The TS 22.233 contains the service requirements for the PSS. The TS 26.233 provides an overview of the PSS. The 
TS 26.234 provides the details of the protocols and codecs used by the PSS. The TS 26.244 defines the 3GPP file 
format (3GP) used by the PSS and MMS services. The TS 26.245 defines the Timed text format used by the PSS and 
MMS services. The TS 26.246 defines the 3GPP SMIL language profile. The present document defines Progressive 
Download and Dynamic Adaptive Streaming over HTTP. 

The TS 26.244, TS 26.245 and TS 26.246 start with Release 6. Earlier releases of the 3GPP file format, the Timed text 
format and the 3GPP SMIL language profile can be found in TS 26.234. 

The TS 26.247 starts with Release 10. Earlier releases of Progressive Download and Dynamic Adaptive Streaming over 
HTTP can be found in TS 26.234. 



Introduction 

Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH) collects a set of technologies how 
progressive download and adaptive streaming of continuous media may be carried out exclusively over HTTP. 
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Scope 



The present document specifies Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH). 
This specification is part of Packet-switched Streaming Service (PSS). HTTP-based progressive download and dynamic 
adaptive streaming are separated from TS 26.234 to differentiate from RTP -based streaming that is maintained in 
TS 26.234. HTTP-based progressive download and dynamic adaptive streaming may be deployed independently from 
RTP-based PSS, for example by using standard HTTP/1.1 servers for hosting data formatted as defined in the present 
document. 
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3 Definitions, abbreviations and conventions 

3.1 Definitions 

For the purposes of the present document, the terms and definitions given in TR 21.905 [7] and the following apply. A 
term defined in the present document takes precedence over the definition of the same term, if any, in TR 2 1 .905 [7] . 

access unit: unit of a media stream with an assigned Media Presentation time. 
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accessibility: the degree to which a media content or certain media content components are available to as many people 
as possible. 

Adaptation Set: a set of interchangeable encoded versions of one or several media content components. 

availableSegment: Segment which is accessible at its assigned HTTP -URL, possibly restricted by a byte range, i.e. the 
request with an HTTP GET results in a reply of the Segment and a 2xx OK status code. 

continuous media: media with an inherent notion of time. In the present document speech, audio, video, timed text and 
timed graphics. 

DASH metric: a metric identified by key and defined in this part of the specification. 

earliest presentation time: the smallest presentation time of any access unit of a Media Segment or Subsegment for a 
media stream. 

frame-packed stereoscopic 3D video: a video consisting of two views in which both views were packed into a single 
stream before compression. 

group: collection of Representations that are expected to not being presented jointly. 

HTTP-URL: a URI with a fixed scheme of 'http' or https. 

Initialization Segment: Segment containing metadata that is necessary to present the media streams encapsulated in 
Media Segments. 

media content: one media content period or a contiguous sequence of media content periods. 

media content component: one continuous component of the media content with an assigned media component type 
that can be encoded individually into a media stream. 

media content component type: a single type of media content such as audio, video, or text. 

media content period: set of media content components that have a common timeline as well as relationships on how 
they may be presented. 

Media Presentation: collection of data that establishes a bounded or unbounded presentation of media content. 

Media Presentation Description (MPD): formalized description for a Media Presentation for the purpose of providing 
a streaming service. 

Media Presentation timeline: concatenation of the timeline of all Periods which itself is common to all 
Representations in the Period. 

Media Segment: Segment that complies with media format in use and enables playback when combined with zero or 
more preceding Segments, and an Initialization Segment (if any). 

media stream: encoded version of a media content component. 

Media Subsegment: Subsegment that only contains media data but no Segment Index. 

multiview stereoscopic 3D video: a video consisting of two views packed into a single stream during compression. 

Period: interval of the Media Presentation, where a contiguous sequence of all Periods constitutes the Media 
Presentation. 

presentation time: a time associated to an access unit that maps it to the Media Presentation timeline. 

Representation: collection and encapsulation of one or more media streams in a delivery format and associated with 
descriptive metadata. 

Segment: smallest addressable unit in an MPD with a defined format. 

Segment availability end time: the time instant in wall-clock time at which a Segment ceases to be an available 
Segment. 
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Segment availability start time: the time instant in wall-clock time at which a Segment becomes an available 
Segment. 

Segment Index: a compact index of the time range to byte range mapping within a Media Segment separately from the 
MPD. 

stereoscopic 3D video: a video bitstream consisting of two views. 

stream access point (SAP): position in a Representation enabling playback of a media stream to be started using only 
the information contained in Representation data starting from that position onwards (preceded by initializing data in 
the Initialization Segment, if any). 

Sub-Representation: part of a Representation described in the MPD that is present in the entire Period. 

Subsegment: smallest unit within Media Segments that is indexed by a Segment Index. 

valid Segment URL: an HTTP-URL that is promised to reference a Segment during its Segment availability period. 

wall-clock time: time as stated by UTC (Universal Co-ordinated Time). 



3.2 



Abbreviations 



For the purposes of the present document, the abbreviations given in TR 2L905 [7] and the following apply. 

An abbreviation defined in the present document takes precedence over the definition of the same abbreviation, if any, 

in TR 21.905 [7]. 

3GP 3GPP file format 

3GP-DASH 3GPP Dynamic Adaptive Streaming over HTTP 

AHS Adaptive HTTP Streaming 

AVC Advanced Video Coding 

DM Device Management 

DRM Digital Rights Management 

HSD HTTP Streaming and Download 

HTML Hypertext Markup Language 

HTTP Hypertext Transfer Protocol 

HTTPS Hypertext Transfer Protocol Secure 

IDR Instantaneous Decoding Refresh 

MPD Media Presentation Description 

MPEG-2 TS Moving Picture Experts Group Transport Stream 

MIME Multipurpose Internet Mail Extensions 

OMA Open Mobile Alliance 

PDCF Packetized DRM Content Format 

PSS Packet-switched Streaming Service 

QoE Quality-of-Experience 

RFC Request For Comments 

RTP Real-time Transport Protocol 

SAP Stream Access Point 

SMIL Synchronised Multimedia Integration Language 

TLS Transport Layer Security 

URI Uniform Resource Identifier 

URL Uniform Resource Locator 

URN Uniform Resource Name 

UTC Universal Time Coordinated 

UTF-8 Unicode Transformation Format (the 8-bit form) 

UUID Universally Unique Identifier 

W3C WWW Consortium 

XML extensible Markup Language 

XSLT extensible Stylesheet Language Transformation 
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3.3 Conventions 

The following naming conventions apply in this specification: 

Elements in an XML-document are identified by an upper-case first letter and in bold face as Element. 
To express that an element Elementl is contained in another element Element2, we may write 
Element2 . Elementl. If an element is constructed of two or more combined words, camel-casing is typically 
used, e.g. ImportantElement. Elements are present exactly once, or the minimum and maximum occurence 
is defined by <minOccurs> ... <maxOccurs>. 

Attributes in an XML-document are identified by a lower-case first letter as well as they are preceded by a "@"- 
sign, e.g. ©attribute. To point to a specific attribute ©attribute contained in an element Element, we 
may write Element@attribute . If an attribute is constructed of two or more combined words, 
camel-casing is typically used after the first word, e.g. ©verylmportantAttribute. Attributes are 
assigned a status in the XML as mandatory (M), optional (O), optional with default value (OD) and conditionally 
mandatory (CM). 

Namespace qualification of elements and attributes is used as per XML standards, in the form of 
namespace : Element or ©namespace : attribute The fully qualified namespace will be provided in 
the schema fragment associated with the declaration. 

Variables defined in the context of the present document are specifically highlighted with italics, e.g. 
IntemalVariable. 

Structures that are defined as part of the hierarchical data model are identified by an upper-case first letter, e.g. 
Media Presentation, Period, Group, Adaptation Set, Representation, Segment, etc. 



Overview 



The present document specifies Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH) for 
continuous media. The features are separated from the umbrella specification TS 26.234 [3] to differentiate from RTP- 
based streaming that is specified and maintained in TS 26.234. Services relying exclusively on these features may be 
deployed independently from RTP-based PSS servers, for example by using standard HTTP/1.1 servers for hosting the 

services. 

The specification covers the following aspects: 

System Description: describes the relationship to the PSS architecture and refines the architecture, interfaces and 
protocols that are defined in this specification. 

Progressive Download over HTTP. 

3GPP Dynamic Adaptive Streaming over HTTP (3G-DASH) provides an overview of the architecture, the 
formats and the models that build the basis for 3GP-DASH. Also, 3GP-DASH Profiles provides an identifier and 
refers to a set of specific restrictions in this or other specifications. 

DASH - Media Presentation describes the data model of a Media Presentation. It also provides an overview on 
elements and attributes that may be used to describe components and properties of a media presentation in a 
Media Presentation Description (MPD). 

DASH - Usage of the 3GP file format defines how segments can be formed based on the 3GP file format. 

Quality-of-Experience for Progressive Download and 3GP-DASH. 

Normative annexes for MPD schema (Annex B), Descriptor Scheme Definitions (Annex C), OMA DM QoE 
Management Object (Annex F), File format extensions for 3GPP DASH support (Annex G) and MIME Type 
Registration for MPD (Annex H). 

Informative annexes for Client Behaviour (Annex A), MPD Examples (Annex D), and Mapping MPD structure 
and semantics to SMIL (Annex E). 
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System Description 



5.1 



Overview 



Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH) enables to provide services to 
deliver continuous media content over Hypertext Transfer Protocol (HTTP) in a sense that all resources that compose 
the service are accessible through HTTP-URLs and the HTTP/1.1 protocol as specified in RFC 2616 [9] may be used to 
deliver the metadata and media data composing the service. This enables that standard HTTP servers and standard 
HTTP caches can be used for hosting and distributing continuous media content. Figure 1 shows the architecture for 
services using progressive download and Figure 2 shows the architecture for services using 3GP-DASH. 

The present document deals with the specification of interfaces between the Client and the Server. Specifically, it 
defines the formats that may be delivered exclusively over the HTTP interface to enable progressive download and 
streaming 
services. 
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Figure 1 : Architecture for Progressive Download over HTTP 

Services using the features described in this specification may be deployed within PSS as specified in TS 26.233 [2] and 
TS 26.234 [3]. In this case the Progressive Download/3GP-DASH Server may be a sub-function of the PSS server and 
the Progressive Download/3GP-DASH client may be a sub-function of the PSS client. 
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Figure 2: Architecture for 3GP-DASH 

Services using the features defined in this specification may also be deployed independent of the PSS servers and 
clients. In this case the Progressive Download/3GP-DASH client shall support the formats and codecs according to this 
specification. 

Access to services based on the features defined in the present document is introduced in clause 5.2. 

The protocol support for services using the features defined in this specification is provided in clause 5.3. 

Clients supporting progressive download-based services shall support the features and formats as specified in clause 6 
of this specification. 

Clients supporting 3GP-D ASH shall support the features and formats as specified in clause 7 of this specification. 

CUents supporting QoE Metrics and Reporting shall support the features as specified in clause 10 of this specification. 
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5.2 



Service Access 



Service access refers to the method by which a Client initially accesses the service. Service access for services based in 
the specification can be achieved e.g. by a Media Presentation Description or a URL to the media file. 

The service access URL can be made available to a client in many different ways. Clients supporting services based on 
the features in this specification shall be able to access services that are provided through an HTTP-URL. However, it is 
out of the scope of this specification to mandate any specific mechanism. A preferred way may be to embed URLs for 
service establishment within HTML pages. 



5.3 



Protocols 



Progressive Download and 3GP-DASH clients shall comply with a client as specified in RFC 2616 [9]. The resource 
hosting the 3GP files and DASH Segments shall comply with a server as specified in RFC 2616 [9]. 

Progressive Download and 3GP-DASH clients should use the HTTP GET method or the HTTP partial GET method, as 
specified in RFC 2616 [9], clause 9.3, to access media offered at HTTP-URLs. 

Figure 3 shows a protocol stack for services in the context of this specification. 3GP Files in progressive download as 
well as Segments based on the 3GPP File Format shall be accessible through HTTP. 
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Figure 3: Overview of thie protocols stack 

Transport security in Progressive Download and Dynamic Adaptive Streaming over HTTP (3GP-DASH) is achieved 
using the HTTPS (Hypertext Transfer Protocol Secure) specified in RFC 2818 [12] and TLS as specified in TLS profile 
of Annex E in TS 33.310 [23]. In case secure delivery is desired, HTTPS should be used to authenticate the server and 
to ensure secure transport of the content from server to client. 

NOTE 1 : The use of HTTPS for delivering Media Segments may inhibit caching at proxies and add overhead at 
the server and the client. 

NOTE 2: In the case of MBMS download delivery of 3GP-DASH content, one way of supporting the delivery of a 
subset of the nominally requested content by the DASH client which indicates explicit willingness to 
accept such incomplete content, and based on a specific UE implementation architecture, is described in 
clause 7.2. A in TR 26.946 [36]. 
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Progressive Download over HTTP 



6.1 General 

As an alternative to conventional streaming, a client may download, typically through HTTP, a media file that 
encapsulates continuous media and may play the media from the local storage. A PSS client shall support progressive 
download and playout of 3GP files [4] as specified in the remainder of this clause. 

The media file encapsulating the continuous media is accessed directly by issuing one or more HTTP GET or partial 
GET requests to the referenced media file. An example of a valid URL is http://example.com/morning_news.3gp. 



6.2 Progressive Download 



Progressive download uses normal HTTP download using HTTP GET or partial GET requests. The differences between 
regular download and Progressive Download are that 1) the content may be authored as progressively downloadable, 
and 2) the terminal recognises that the content is suitable for progressive download. 

A client downloading continuous media may decide to start playout of the encapsulated media data before the download 
of the media file is completed. 

6.3 3GPP File Format Profiles 

The following profiles of the 3GPP file format in TS 26.244 [4] shall be supported by clients supporting Progressive 
Download over HTTP: 

Basic profile, and 

- Progressive-download profile. 



3GPP Dynamic Adaptive Streaming over HTTP 



7.1 System Description 



The 3GPP Dynamic Adaptive Streaming over HTTP (3GP-DASH) specified in this specification provides streaming 
services over HTTP. For this it specifies XML and binary formats that enable delivering content from standard HTTP 
servers to an HTTP-Streaming client and enables caching content by standard HTTP caches. 

The specification for 3GP-DASH primarily defines two formats: 

1) The Media Presentation Description (MPD) describes a Media Presentation, i.e. a bounded or unbounded 

presentation of media content. In particular, it defines formats to announce resource identifiers for Segments and 
to provide the context for these identified resources within a Media Presentation. For 3GP-DASH, the resource 
identifiers are exclusively HTTP-URLs possibly combined with a byte range. 

2) The Segment formats specify the formats of the entity body of the HTTP response to an HTTP GET request or an 

HTTP partial GET request with the indicated byte range through HTTP/Ll as defined in RFC 2616 [9] to a 
resource identified in the MPD. Segments typically contain efficiently coded media data and metadata according 
to or aligned with common media formats.. 

The MPD provides sufficient information for a client to provide a streaming service to the user by accessing the 
Segments through the protocol specified in the scheme of the defined resources, in the context of this specification 
exclusively HTTP/LL Such a client is referred to as a 3GP-DASH client in the remainder of the present document. 
However, this specification does not provide a normative definition for such a client. An informative client model to 
illustrate the formats defined in this specification is provided in section 7.2. An informative example client behaviour 
description is provided in Annex A of this specification. 
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Figure 7-1 shows an architecture in which the formats defined in this specification are typically used. Boxes with solid 
lines indicate devices that are mentioned in this specification as they host or process the formats defined in this 
specification whereas dashed boxes are conceptual or transparent. This specification deals with the definition of formats 
that are accessible on the interface to the 3GP-DASH client, indicated by the solid lines. Any other formats or interfaces 
are not in scope of this specification. In the considered deployment scenario, it is assumed that the 3GP-D ASH client 
has access to an MPD. The MFD provides sufficient information for the 3GP-DASH cUent to provide a streaming 
service to the user by requesting Segments from an HTTP server and demultiplexing, decoding and rendering the 
included media streams. 
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Figure 7-1 : System Architecture for 3GP-DASH 

The normative aspects of 3GP-DASH formats are defined by 

the profiles defined in clause 7.3. 

the DASH Media Presentation as defined in clause 8. 

the usage of the 3GPP file format for DASH as defined in clause 9. 

The clauses mentioned above may refer to normative aspects in clause 10 on Quality-of-Experience as well as to 
normative Annexes B, C, E, G, and H. 



7.2 



3GP-DASH Client Model 



The design of the formats defined in this specification is based on the informative client model as shown in Figure 7-2. 
The figure illustrates the logical components of a conceptual 3GP-DASH client model. In this figure the 3GP-DASH 
Access Engine receives the Media Presentation Description (MPD), constructs and issues requests and receives 
Segments or parts of Segments. In the context of this standard, the output of the DASH Access Engine consists of 
media in container formats according to the ISO/lEC 14496-12 ISO Base Media File Format [11] and specifically the 
3GP file format [4]. In addition, timing information is provided that maps the internal timing of the media to the time 
line of the Media Presentation. 
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Figure 7-2: 3GP-DASH client Model 



7.3 3GP-DASH Profiles 
7.3.1 General 

Profiles of 3GP-DASH are defined so as to enable interoperability and the signaling of the use of features etc. A profile 
refers to a set of specific restrictions. Those restrictions might be on features of the MPD as defined in clause 8 of this 
specification, Segment formats as for example defined in clause 9 of this specification, usage of the network, codec(s) 
used, content protection formats, or on quantitative measures such as bit-rates, segment lengths, screen size, and so on. 
Profiles defined in this specification define restrictions on features of this specification, but may additionally impose 
restrictions on other aspects of media delivery. 

NOTE A profile can also be understood as permission for 3GP-DASH clients that only implement the features 

required by the profile to process the Media Presentation. However, as 3GP-DASH client operation is not 
specified normatively, it is also unspecified how a 3GP-DASH client conforms to a particular profile. 
Hence, profiles merely specify restrictions on MPD and Segments rather than DASH client behaviour. 

A profile has an identifier, which is a URL The profiles with which a Media Presentation complies are indicated in the 
MPD@prof iles attribute. This element is a comma-separated list of profile identifiers. Profile identifiers defined in 
this specification are URNs conforming to RFC 3406 [21]. URLs may also be used. When a URL is used, it should also 
contain a month-date in the form mmyyyy; the assignment of the URL must have been authorized by the owner of the 
domain name in that URL on or very close to that date, to avoid problems when domain names change ownership. 

An MPD is conforming when it satisfies the following: 

1. The MPD is valid in terms the schema defined in Annex B. 

2. The MPD conforms to the normative requirements defined in this specification. 

3. The MPD conforms to each of the profiles indicated in the MPD@prof iles attribute as specified below. 

When ProfA is included in the MPD@prof iles attribute, the MPD is modified into a profile-specific MPD for profile 
conformance checking using the following ordered steps: 

1. The MPD@prof iles attribute of the profile-specific MPD contains only ProfA. 

2. An AdaptatlonSet element for which ©profiles does not or is not inferred to include ProfA is 
removed from the profile-specific MPD. 

3. A Representation element for which ©profiles does not or is not inferred to include ProfA is 
removed from the profile-specific MPD. 

4. All elements or attributes that are either (i) in this specification and explicitly excluded by ProfA, or (ii) in 
an extension namespace and not expUcitly included by ProfA, are removed from the profile-specific MPD. 

5. All elements and attributes that 'may be ignored' according to the specification of ProfA are removed from 
the profile-specific MPD, 

An MPD is conforming to profile ProfA when it satisfies the following: 
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1. ProfA is included in the MPD@prof iles attribute. 

2. The profile-specific MPD for ProfA is valid in terms the schema defined in Annex B. 

3. The profile-specific MPD for ProfA conforms to the normative semantics defined in this specification. 

4. The profile-specific MPD for ProfA conforms to the restrictions specified for ProfA. 
A Media Presentation is conforming to profile /5rq/^ when it satisfies the following: 

1. The MPD of the Media Presentation is conforming to profile ProfA as specified above. 

2. There is at least one Representation in each Period in the profile-specific MPD for ProfA. 

3. The Segments of the Representations of the profile-specific MPD for ProfA conform to the restrictions 
specified for ProfA. 

NOTE In other words, each MPD contains at least one Representation in each Period, which fulfils the 

requirements of a profile listed in MPD@prof iles. There may be stricter rules on the occurrence of 
Representations in the specified profiles. For example, it can be required that there is at least one 
Representation for each media type that contains or is inferred to have the profile identifier of a specific 
profile. 

7.3.2 3GPP Adaptive HTTP Streaming (Release-9 AHS) 

Release-9 Adaptive HTTP Streaming as defined in TS 26.234 [3] Release-9, clause 12 is not a profile of this 

specification. Rel-9 AHS uses a different namespace 

"urn : 3GPP : ns : PSS : Adapt iveHTTPStreamingMPD : 2009" and a different MIME type signalling 

"application/3gpp-ahs+xml" for the MPD. However, a Media Presentation may be defined such that segments 

complying with the segment formats in TS 26.234 [3] Release-9, clause 12, also comply with segment formats for this 

specification. 

7.3.3 3GP-DASH Release-1 Profile 

7.3.3.1 Introduction 

The 3GP-DASH Release-10 profile is identified by the URN 'urn : 3GPP : PSS : profile : DASHIO'. 

This includes all features defined in the Release-10 version of this specification in clauses 7.3.3.2, 7.3.3.3, 8, 9 and 10. 

The ©mimeType attribute of each Representation shall be provided according to RFC4337. Additional parameters may 
be added according to RFC6381 [26]. 

7.3.3.2 Media Codecs 

For the 3GP-DASH Release-10 profile clients supporting a particular continuous media type, the corresponding media 
decoders are specified in TS 26.234 [3], clause 7.2 for speech, 7.3 for audio, 7.4 for video, 7.9 for timed text and 7.1 1 
for timed graphics. 

7.3.3.3 Content Protection 

For the 3GP-DASH Release-10 profile clients content protection may support OMA DRM 2.0 [15] or OMA DRM 2.1 
[16]. Other content protection schemes may be supported. The ContentProtection element in the MPD should be used to 
convey content protection information. 

When using OMA DRM V2.0 or OMA DRM V2.1 scheme for content protection, the non-streamable Packetized DRM 
Content Format (PDCF) shall be used. An OMA-DRM encrypted Representation shall include the brand 'opf2'. OMA- 
DRM [15] [16] defines the procedures for acquiring the Rights Object from the Rights Issuer to decrypt PDCF 
protected content. The scheme is identified by a ContentProteGtion@schemeIdUri set to 
"urn :mpeg: dash :mp4protect ion" and the ContentProtection@value shall include the version 
number; it starts with "odkm", which is the scheme_type contained in the Scheme Type Box of the PDCF file, followed 
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by a ":" and the scheme_version from the Scheme Type Box of the PDCF file, encoded as up to 8 hexadecimal digits, 
where the leading "0"s may be omitted. For example, for OMA DRM2.0 the value could be "odkm:200". 

7.3.4 3GP-DASH Release 1 1 multiview stereoscopic 3D video profile 

The 3GP-DASH Release 1 1 multiview stereoscopic 3D video profile is identified by the URN 

'urn :3GPP:PSS: prof ile:DASHll:MS3D'. 

The ©mimeType attribute of each Representation shall be provided according to RFC4337. Additional parameters may 
be added according to RFC6381 [26]. 

This profile includes all features defined in clauses 7.3.3.3, 8, 9 and 10. 

3GP-DASH Release 1 1 multiview stereoscopic 3D video profile clients shall support multiview stereoscopic 3D video 
as specified in clause 7.4 of TS 26.234 [3]. For any other particular continuous media type, the corresponding media 
decoders are specified in TS 26.234 [3], clause 7.2 for speech, 7.3 for audio, 7.4 for video, 7.9 for timed text and 7.1 1 
for timed graphics. Additionally, the following contraints apply for multiview stereoscopic 3D video bitstreams, if 
present in a media presentation: 

The DASH multiple views scheme as defined in 5.8.5.6 of ISO/IEC 23009-1 [34] shall be used in the MPD. 

One of the following shall be true: 

• The base view of the stereoscopic multiview bitstream shall be a complementary representation and the 
non-base view of the bitstream shall be a dependent representation. The @dependencyld attribute as 
specified in 5.3.5.2 of ISO/IEC 23009-1 [34] shall be used to indicate the complementary and 
dependent representations. 

• The base view and the non-base view of the stereoscopic multiview bitstream shall reside in the same 
representation. The SubRepresentation element shall be used for the representation, and the base view 
and the non-base view shall form separate sub -representations. The @level and @dependencyLevel 
attributes within the SubRepresentation element shall be used. The Level Assignment box shall be used. 
For each leaf segment index, that is, each Segment Index box that indexes only subsegments but not 
other Segment index boxes, there shall be exactly one Subsegment Index box. 



7.3.5 3GP-DASH Release 1 1 frame-packed stereoscopic 3D video profile 

The 3GP-DASH Release 1 1 frame-packed stereoscopic 3D video profile is identified by the URN 

'urn : 3GPP : PSS : profile : DASHll : FPS3D'. 

The ©mimeType attribute of each Representation shall be provided according to RFC4337. Additional parameters may 
be added according to RFC6381 [26]. 

This profile includes all features defined in clauses 7.3.3.3, 8, 9 and 10. 

3GP-DASH Release 1 1 frame-packed stereoscopic 3D video profile clients shall support frame-packed stereoscopic 3D 
video as specified in clause 7.4 of TS 26.234 [3]. For any other particular continuous media type, the corresponding 
media decoders are specified in TS 26.234 [3], clause 7.2 for speech, 7.3 for audio, 7.4 for video, 7.9 for timed text and 
7. 11 for timed graphics. Additionally, the following contraints apply for frame-packed stereoscopic 3D video 
bitstreams, if present in a media presentation: 

The FramePacking element as defined in clause 8.4.3.2 shall be used in the MPD. 
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8 DASH - Media Presentation 

8.1 Introduction 

A Media Presentation is a structured collection of data that is accessible to a 3GP-DASH client to provide a streaming 
service to the user. 

3GP-DASH is intended to support a media-streaming model for delivery of media content in which control of the 
delivery lies exclusively with the client. Clients may request data using the HTTP protocol from standard web servers 
that have no 3GP-DASH-specific capabilities. Consequently, this standard focuses not on client or server procedures 
but on the data formats used to provide a DASH Media Presentation. 

The collection of encoded and deliverable versions of media content and the appropriate description of these form a 
Media Presentation. Media content is composed of a single or multiple contiguous media content periods in time. Each 
media content period is composed of one or multiple media content components, for example audio components in 
various languages and a video component. Each media content component has an assigned media content component 
type, for example audio or video. 

Each media content component may have several encoded versions, referred to as media streams. Each media stream 
inherits the properties of the media content, the media content period, the media content component from which it was 
encoded and in addition it gets assigned the properties of the encoding process such as sub-sampling, codec parameters, 
encoding bitrate, etc. This describing metadata is relevant for static and dynamic selection of media content components 
and media streams. 
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Figure 8.1 : 3GP-DASH High-Level Data Model 

DASH is based on a hierarchical data model aligned with the presentation in Figure 8.1. A DASH Media Presentation is 
described by a Media Presentation Description (see clause 8.4.1) document. This describes the sequence of 
Periods (see clause 8.4.2) in time that make up the Media Presentation. A Period typically represents a media content 
period during which a consistent set of encoded versions of the media content is available i.e. the set of available 
bitrates, languages, captions, subtitles etc. does not change during a Period. 

Within a Period, material is arranged into Adaptation Sets (see clause 8.4.3.3). An Adaptation Set represents a set of 
interchangeable encoded versions of one or several media content components. For example there may be one 
Adaptation Set for the main video component and a separate one for the main audio component. If there is other 
material available, for example captions or audio descriptions, then these may each have a separate Adaptation Set. 
Material may also be provided in multiplexed form, in which case interchangeable versions of the multiplex may be 
described as a single Adaptation Set, for example an Adaptation Set containing both the main audio and main video for 
a Period. Each of the multiplexed components may be described individually by a media content component 
description. 

An Adaptation Set contains a set of Representations (see clause 8.4.3.4). A Representation describes a deliverable 
encoded version of one or several media content components. A Representation includes one or more media streams 
(one for each media content component in the multiplex). Any single Representation within an Adaptation Set is 
sufficient to render the contained media content components. Typically, clients may switch from Representation to 
Representation within an Adaptation Set in order to adapt to network conditions or other factors. Chents may also 
ignore Representations that rely on codecs or other rendering technologies they do not support or that are otherwise 
unsuitable. 
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Within a Representation, the content may be divided in time into Segments (see clause 8.4.4 and clause 9). A URL is 
provided for each Segment meaning that a Segment is the largest unit of data that can be retrieved with a single HTTP 
request. 

DASH defines different timelines. One of the key features in DASH is that encoded versions of different media content 
components share a common timeline. The presentation time of access unit within the media content is mapped to the 
global common presentation timeline for synchronization of different media components and to enable seamless 
switching of different coded versions of the same media components. This timeline is referred as Media Presentation 
timeline. The Media Segments themselves contain accurate Media Presentation timing information enabling 
synchronization of components and seamless switching. 

A second timeline is used to signal to clients the availability time of Segments at the specified HTTP-URLs. These 
times are referred to as Segment availability timesand are provided in wall-clock time. Clients typically compare the 
wall-clock time to Segment availability times before accessing the Segments at the specified HTTP-URLs. For On- 
Demand services with a static MPD, the availability times of all Segments are identical. For live services when the 
MPD is updated, the availability times of Segments depend on the position of the Segment in the Media Presentation 
timeline. 

Segments are assigned a duration, which is the duration of the media contained in the Segment when presented at 
normal speed. Typically all Segments in a Representation have the same or roughly similar duration. However Segment 
duration may differ from Representation to Representation. A DASH presentation can be constructed with relative short 
Segments (for example a few seconds), or longer Segments including a single Segment for the whole Representation. 

Short Segments are usually required in the case of live content, where there are restrictions on end-to-end latency. The 
duration of a Segment is typically a lower bound on the end-to-end latency. DASH does not support the possibility for 
Segments to be extended over time: a Segment is a complete and discrete unit that must be made available in its 
entirety. 

Segments may be further subdivided into Subsegments each of which contains a whole number of complete access 
units. In formats defined in this specification, a Subsegment contains a whole number of complete movie fragments. A 
Segment may be divided into Subsegments described by a compact Segment index, which provides the presentation 
time range in the Representation and corresponding byte range in the Segment occupied by each Subsegment. Clients 
may download this index in advance and then issue requests for individual Subsegments. 

Clients may switch from Representation to Representation within an Adaptation Set at any time in the media content. 
However, switching at arbitrary positions may be complicated because of coding dependencies within Representations 
and other factors. It is also desirable to avoid download of 'overlapping' data i.e. media for the same time period from 
multiple Representations. Usually, switching is simplest at a random access point in the new stream. In order to 
formalize requirements related to switching DASH defines a codec-independent concept of Stream Access Point and 
identifies various types of Stream Access Point. 

Segmentation and Subsegmentation may be performed in ways that make switching simpler. For example, in the very 
simplest cases each Segment or Subsegment begins with a random access point and the boundaries of Segments or 
Subsegments are aligned across the Representations of an Adaptation Set. In this case, switching Representation 
involves playing to the end of a (Sub)Segment of one Representation and then playing from the beginning of the next 
(Sub)Segment of the new Representation. The Media Presentation Description and Segment Index provide various 
indications, which describe properties of the Representations that may make switching simpler. 

For On-Demand services, the Media Presentation Description is a static document describing the various aspects of the 
Media Presentation. All Segments of the Media Presentation are available on the server once any Segment is available. 
For live services, however. Segments become available with time as the content is produced. The Media Presentation 
Description may be updated regularly to reflect changes in the presentation over time, for example Segment URLs for 
new Segments may be added to the MPD and those for old, no longer available Segments may be removed. However, if 
Segment URLs are described using a template, this updating may not be necessary except for some redundancy/failover 
cases. 
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In summary a Media Presentation is described in a Media Presentation Description (MPD) including any possible 
updates of the MPD. The MPD is defined in clause 8.2 and the update mechanisms in 8.5. Assembly of a fragmented 
MPD is defined in 8.3. The data model that constitutes a Media Presentation is defined in 8.4 and some additional 
elements in the MPD that describe the content are provided in 8.6. 

8.2 Media Presentation Description 

8.2.1 General 

The Media Presentation Description (MPD) is a document that contains metadata required by a 3GP-DASH client to 
construct appropriate HTTP-URLs to access Segments and to provide the streaming service to the user. 

NOTE: actual playback of the media streams included in the Representations is not controlled by the MPD 

information. Playback is controlled by the media engine operating on the media streams contained in the 
Representations in the usual way. 

The format of URLs in the MPD and the process to generate HTTP GET and partial GET requests from URLs provided 
in the MPD is defined in 8.7. 

The MPD is an XML-document that is formatted according to the XML schema provided in clause 8.2.2. 

The MIME type of the MPD shall be 'application/dash+xml' as defined in Annex H.L 

MPDs may be updated as specified in clause 8.5. Updates may also be done using MPD delta files as defined in clause 
8.5.2. The MIME type of an MPD delta file shall be 'application/dashdelta+xml' as defined in Annex H.2. 

The delivery of the MPD is not in scope of this specification. If the MPD is delivered over HTTP, then the MPD may be 
transfer encoded for transport, as described in [18] using the generic GZip algorithm RFC 1952 [18]. 3GP-DASH 
cUents shall support GZip content decoding of the MPD when delivered over HTTP (GZIP RFC 1952 [18], clause 9). 

8.2.2 Schema and 3GPP Extension 

The overview of the XML schema of the MPD is provided in below. Specific types, elements and attributes are 
introduced in the remainder of this clause. The complete MPD schema is provided in Annex B of this specification. In 
case of any inconsistencies the schema in Annex B takes precedence over the XML-syntax snippets provided in this 
clause. For the normative schema refer to the schema in Annex B. 

The main schema is provided in Table 8-1 with the namespace "urn : mpeg : DASH : schema :MPD : 2011". The 
3GPP extension namespace is provided in Table 8-2 with namespace "urn: 3GPP :ns : DASH: MPD -ext : 2 011". An 
extension schema for 3GPP in the context of the specification is referred to as "3gpp-2011.xsd". Elements and attributes 
in the extension namespace are preceded with "x3gpp : " throughout this document. 

The MPD shall be authored such that, after unrecognized XML attributes or elements are removed, the result is a valid 
XML document formatted according to the XML schema provided in Annex B and that complies with this 
specification. Namespaces may be used to extend functionalities. Therefore, all extended elements and attributes added 
to a Representation in particular shall be such that they can be safely ignored by 3GP-DASH clients. 

Example for vaUd MPDs are provided in Annex D. 

Table 8-1 : Overview of XML schema of the MPD 

<?xml version="l . 0"?> 

<xs : schema targetNamespace= "urn: mpeg: DASH: schema :MPD : 2011" 

attributeFormDef ault : "unqualified" eiementFormDef aul "qualified" 

xmlns :xs=" http://www.w3 . org/2 l/XMLSchema" 

xmlns :xlink="http : //www.w3 . org/1999/xlink" 

xmlns :x3gpp="urn: 3GPP:ns : DASH: MPD- ext :2 011" 

xmlns = "urn : mpeg : DASH : schema : MPD : 2 11 " > 

<xs : annotation> 

<xs : appinfoMedia Presentation Description</xs : appinf o> 

</xs : annotation> 

<xs : import namespace; "http://www.w3.org/1999/xlink" 3chemaLocation="xlink.xsd"/> 

<xs : import namespace="urn : 3GPP:ns : DASH: MPD -ext :2011" schemaLocation="3gpp-2 011 .xsd"/> 
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<!-- MPD : main 


element - 


- > 




<xs : element 


"MPD" 




"MPDtype"/> 


</xs : schema> 









Table 8-2: Overview of XML schema for 3GPP MPD extensions 



<?xml version="l . 0" ?> 

<xs : schema targetNamespace="urn: 3GPP:ns : DASH: MPD -ext :2011" 

attributeFormDef ault- "unqualified" elementFormDefaul : "qualified" 

xmlns :xs="http; //www.w3 . org/2 1/XMLSchema" 

xmlns = "urn : 3GPP : ns : DASH : MPD- ext : 2 11 " > 

<xs : annotation> 

<xs : appinfoExtensions to Media Presentation Description for 3GPP</xs : appinf o> 

</xs : annotation> 



</xs : schema> 



8.2.3 (void) 

8.2.4 (void) 

8.3 MPD Assembly 



8.3.1 



Introduction 



This clause defines a mechanism for referencing a remote DASH element from within a local MPD. A subset of W3C 
XLINK [20] simple links is defined consisting of: 

restricted syntax and semantics in clause 8.3.2, and 

the processing model in clause 8.3.3. 

8.3.2 Syntax and semantics 

Table 8-3 provides the XLINK attributes that are used in this specification and shall be supported accordingly. 

Table 8-3: XLINK attributes used in this specification 



Attribute 


Comments and Usage 


©xlink: type 


Identifies tlie type of W3C XLINK being used. 

In the context of specification, all references shall be W3C XLINK simple linl<s. As the 

attribute @xlink:type is optional with fixed setting @xlink:type=" simple". 


©xlink : href 


Identifies the remote DASH Element by URI as defines in IETF RFC 3986 [17]. 
In the context of this specification, URI shall exclusively be HTTP-URLs. 


©xlink : show 


Defines the desired behaviour of a remote DASH element once dereferenced from within 

a MPD as defined in W3C XLINK. 

In the context of this specification the attribute oxlink : show is optional with fixed 

setting @xl ink :show=" embed". 

NOTE: In W3C XLINK, the behaviour of conforming XLink applications when 

embedding XML-based ending resources, such as a remote DASH element, is 
not defined. Thus, the actual behaviour for this standard is defined in clause 
8.3.3. 
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Attribute 



Comments and Usage 



©xlink : actuate 



Defines the desired timing of dereferencing a remote DASH-Element from within a IVIPD 
as defined in W3C XLINK. The following attribute values are allowed in this standard: 

1) onLoad: an application should dereference the remote DASH element immediately 
on loading the MPD. 

2) onRequest (default): formally, an application should dereference the remote DASH- 
element only on a post-loading event triggered for the purpose of dereferencing. In the 
context of this specification, the application dereferences the link only for those resources 
it needs (or anticipates it probably will need). Examples include de-referencing a link in a 
Period element when the play-time is expected to enter that period, de-referencing a 
representation group link when it appears to contain representations that will be needed, 
and so on. 



The restricted schema for XLINK in the context of the standard is referred to as "xlink.xsd" in any schema in this 
standard and defined in Table 8-4. 

Table 8-4: XML Schema for XLINK attributes used in this specification 



<?xml version= ' 1 . ' ?> 

<xs : schema xmlns :xs="http : //www. w3 . org/2 001/XMLSchema" 

targetNamespace="http : //www.w3 .org/1999/xlink" 

xmlns :xlink="http : //www.w3 .org/1999/xlink" > 

<xs : attribute name="type" type="xs : token" f ixed-"simple"/> 

<xs : attribute name="href" type="xlink:hrefType"/> 

<xs : simpleType "href Type" > 

<xs : restriction "xs : anyURI"/> 
</xs : simpleType> 

<xs : attribute name="show" type="xs : token" f ixed-"embed"/> 

<xs : attribute name="actuate" type="xlink: actuateType" defauli "onRequest"/> 

<xs : simpleType "actuateType" > 
<xs : restriction base="xs : token" > 

<xs : enumeration value "onLoad"/> 
<xs : enumeration value= "onRequest "/> 
</xs : restriction> 
</xs : simpleType> 
</xs : schema> 



8.3.3 Processing 

The following rules apply to the processing of URI references within @xlink : href: 

1) URI references to remote elements that cannot be resolved shall be treated as invalid references and invalidate 
the MPD. 

2) URI references to remote elements that are inappropriate targets for the given reference shall be treated as 
invalid references (see list below for the appropriate targets) references and invalidate the MPD. 

3) URI references that directly or indirectly reference themselves are treated as invalid circular references 
references and invalidate the MPD. 

4) Any URI reference to a remote element shall be an HTTP-URL. 

5) If a URI reference is relative then reference resolution as defined in 8.7.4 shall apply. 

The remote elements referenced from within an MPD (referred to as appropriate targets) shall be embedded into the 
MPD by applying the following rules: 

1) Attributes and elements obtained from the remote element shall be added to the element of the MPD that 
contains @xlink : href and shall be merged with the ones already present in the MPD. If the same attributes 
are present in both MPD and remote element, the attribute values should be the same. If they are not identical. 
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then the value of the attribute of the MPD takes precedence over the value of the attribute in the remote DASH 
element. 

2) The remote DASH element referenced by the ©xlink : href shall conform to the type definition of the 
element in the MPD that contains ©xlink : href. 

3) All XLINK attributes shall be removed after dereferencing is completed. 

4) Only a single element shall be included in a remote DASH element. 

5) All resources in the remote element referenced by @xl ink : href shall have an availability end time as 
specified by MPD@ava i 1 ab i 1 i tyEnciT ime . 



8.4 



Hierarchical Data IVIodel 



8.4.1 General 

A Media Presentation is described in the MPD element that is contained in an MPD document formatted as defined in 

clause 8.2. 

A Media Presentation consists of: 

A sequence of one or more Periods described in 8.4.2. 

Each Period contains one or more Adaptation Sets that itself contains one or more Representations as described 
in clause 8.4.3. Clause 8.4.3 also defines media content components and Sub-Representations. 

Each Representation consists of one or more Segments. Segment Information is introduced in clause 8.4.4. 
Segments contain media data and/or metadata to access, decode and present the included media content. 

The summary of the semantics of the attributes and elements within an MPD element are provided in Table 8-5. The 
XML-syntax of the MPD element is provided in Table 8-6. 

Table 8-5: Semantics of mpd element 



Element or Attribute Name 


Use 


Description 


MPD 




The root element that carries the Media Presentation 
Description for a Media Presentation. 


@ici 





specifies an identifier for the Media Presentation. It is 
recommended to use an identifier that is unique within the 
scope in which the Media Presentation is published. 

If not specified, no MPD-lnternal Identifier Is provided. 
However, for example the URL to the MPD may be used as 
an identifier for the Media Presentation. 


©profiles 


M 


specifies a list of Media Presentation profiles as described in 
section 7.3. 

The contents of this attribute shall conform to either the 
pro- simple or pro- fancy productions of RFC6381, 
Section 4.5, without the enclosing dquote characters, i.e. 
including only the unencodedv or encodedv elements 
respectively. As profile identifier the URI defined for the 
conforming Media Presentation profiles as described in 7.3 
shall be used. 


©type 


OD 
default: 

static 


specifies whether the Media Presentation Description may 
be updated (@type=" dynamic) or not 
(@type=" static"). 

NOTE Static MPDs are typically used for On-Demand 
services, whereas dynamic MPDs are used for live services. 


(SavailabilityStartTime 


CM 

Must be 

present for 


For @type='dynamic' this attribute shall be present. In this 
case it specifies the anchor for the computation of the 
segment availability start time for any Segment in the Media 
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Element or Attribute Name 


Use 


Description 




type='Live' 


Presentation. 

For @type='static' , if present, it specifies the Segment 

availability start time for all Segments referred to in this 

MPD. If not present, all Segments described in the MPD 

shall be become available at the time the MPD becomes 

available. 


(SavailabilityEndTime 





specifies the latest Segment availability end time for any 
Segment in the Media Presentation. When not present, the 
value is unknown. 


(SmediaPresentationDuration 


CM 

Must be 

present for 

@type='stat 
ic' 


specifies the duration of the entire Media Presentation. 
If the attribute is not present, the duration of the Media 
Presentation is unknown. In this case the attribute 
MPDOminimumUpdatePeriod shall be present. 
This attribute shall be present when the attribute 
MPDOminimumUpdatePeriod is not present. 


(SminimumUpdatePeriod 





If this attribute is present, it specifies the smallest period 

between potential changes to the MPD. This can be useful 

to control the frequency at which a client checks for 

updates.. This can be useful to control the frequency at 

which a client checks for updates. 

If this attribute is not present it indicates that the MPD does 

not change. 

If MPD@type is "static", (SminimumUpdatePeriod 

shall not be present. 

Details on the use of the value of this attribute are specified 
in 8.5. 


OminBuf f erTime 


M 


specifies a common duration used in the definition of the 
Representation data rate (see obandwidth attribute in 
8.4.3.4). 


(StimeShiftBuf ferDepth 





specifies the duration of the time shifting buffer that is 
guaranteed to be available for a Media Presentation with 
type ' dynamic ' . When not present, the value is infinite. 
This value of the attribute is undefined if the ©type attribute 
is equal to "static". 


©suggestedPresentationDelay 





when ©type is 'dynamic', it specifies a fixed delay offset in 
time from the from the presentation time of each access unit 
that is suggested to be used for presentation of each access 
unit. For more details refer to 9.4.1 .2. When not specified, 
the no value is provided and the client is expected to choose 
a suitable value. 

when ©type is 'static' the value of the attribute is 
undefined and may be ignored. 


OmaxSegmentDuration 





specifies the maximum duration of any Segment in any 
Representation in the Media Presentation, i.e. documented 
in this MPD and any future update of the MPD. If not 
present, then the maximum Segment duration shall be the 
maximum duration of any Segment documented in this 
MPD. 


OmaxSubsegmentDuration 





specifies the maximum duration of any Media Subsegment 
in any Representation in the Media Presentation. If not 
present, the same value as for the maximum Segment 
duration is implied. 


Programlnformation 


... N 


specifies descriptive information about the program. For 
more details refer to the description in clause 8.6.2. 


BaseURL 


... N 


specifies a Base URL that can be used for reference 
resolution and alternative URL selection. For more details 
refer to the description in clause 8.7. 


x3gpp : Del taSupport 


... N 


If present, this element specifies that MPD delta files are 
supported by the server. For more details refer to the 
description in clause 8.5.2. 


Location 


... N 


specifies an absolute URL where the MPD is available. 
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Element or Attribute Name 


Use 


Description 


Period 


1 ... N 


specifies a Period. For more details refer to the description 
in clause 8.4.2. 


Metrics 


0... N 


specifies information about the requested QoE metrics. For 
more details refer to clause 10.3. 

At most one Metrics element shall be present in the IVIPD. 
NOTE: The schema allows more than one Metrics 
elements for potential future extensions. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally IVIandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @ 



Table 8-6: Syntax of mpd element 



< ! - - MPD Type - - > 

<xs : complexType ,■_ "MPDtype"> 

<xs : sequence> 

<xs:element "Programlnformation" t"^'"'=--"ProgramInformationType" minO'^'-'i" "0" 
maxOccurs= "unbounded" /> 

<xs:element ..,.,,^-- "BaseURL" "BaseUrlType" ,,.^nOccurs = "0" maxOccurp-"unbounded"/> 
<xs:element ;iame = "Location" "xs:anyURI" minOccurs="0" maxOccurs- "unbounded"/> 
<xs: element name = "Period" "PeriodType" iTiaxOccurs="unbounded"/> 
<xs:element name = "iyietrics" "MetricsType" minOccurs = "0" 

m = :^ n ,- ,- 1 n- = - Ti unbounded " / > 
<xs:element "x3gpp iDeltaSupport" type="DeltaSupportType" minOccurs="0"/> 

</xs : sequence> 

<xs :attribute name="id" type^ xs : string"/> 

<xs : attribute name="prof iles" "xs:string" ...j»_--"required"/> 

<xs : attribute name="type" --"PresentationType" def ault="static"/> 

<xs : attribute name="availabilityStartTime" type="xs :dateTime"/> 

<xs : attribute name="availabilityEndTime" type="xs :dateTime"/> 

<xs : attribute name="mediaPresentationDuration" tV|...---"xs : duration" /> 

<xs : attribute name="minimumUpdatePeriod" type="xs :duration"/> 

<xs :attribute name="minBuf f erTime" Ly;:'e = "xs :duration" ' " - "required"/> 

<xs : attribute name="timeShif tBuf f erDepth" ype="xs : duration" /> 

<xs : attribute name="suggestedPresentationDelay" type "xs :duration"/> 

<xs : attribute name="maxSegmentDuration" type="xs : duration" /> 

<xs : attribute name="maxSubsegmentDuration" type="xs : duration" /> 

<xs : anyAttribute namespar "##other" processContents "lax"/> 
</xs : complexType > 

<!-- Type of presentation - live or on-demand --> 
<xs : simpleType riair[e = "PresentationType" > 
<xs : restriction base="xs : string" > 

<xs : enumeration value ="static"/> 
<xs : enumeration value="dynamic"/> 
</xs : restriction> 
</xs : simpleType> 



8.4.2 Period 

A Media Presentation consists of one or more Periods. A Period is defined by Period element in the MPD element. 

The type of the Period, either a regular Period or an Early Available Period, as well as the PeriodStart time of a regular 
Period is determined as follows: 

o If the attribute ©start is present in the Period, then the Period is a regular Period and the PeriodStart is 
equal to the value of this attribute. 

o If the ©start attribute is absent, but the previous Period element contains a ©duration attribute then 
this new Period is also a regular Period. The start time of the new Period PeriodStari is the sum of the start 
time of the previous Period PeriodStart and the value of the attribute ©duration of the previous Period. 

o If (i) ©start attribute is absent, and (ii) the Period element is the first in the MPD, and (iii) the 
MPD©type is ' static ' , then the PeriodStart time shall be set to zero. 
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o If (i) ©start attribute is absent, and (ii) the previous Period element does not contains a ©duration 
attribute or the Period element is the first in the MPD, and (iii) the MPD@type is ' dynamic ' , then this 
Period is an Early Available Period (see below for details). 

For any regular Period the following holds: PeriodStart reflects the actual time that should elapse after playing the 
media of all prior Periods in this Media Presentation relative to the PeriodStart time of the first Period in the Media 
Presentation. The Period extends until the PeriodStart of the next Period, or until the end of the Media Presentation in 
the case of the last Period. More specifically, the difference between the PeriodStart time of a Period and either the 
PeriodStart time of the following Period, if this is not the last Period, or the value of the 

MPD@mediaPresentationDuration if this is the last one, is the presentation duration in Media Presentation time 
of the media content represented by the Representations in this Period. 

Early Available Periods may be used to advertise initialization of other non-media data before the media data itself is 
available. Period elements documenting early available Periods shall not occur before any Period element 
documenting a regular Period. For Early Available Periods, any resources that are announced in such a Period 
element shall be available. Such a Period element shall not contain URLs to Media Segments. The data contained in 
such a Period element does not represent a Period in the Media Presentation. Only when the PeriodStart time 
becomes known through an update of the MPD, such a Period element represents a regular Period. However, an 
update of the MPD may even remove a Period element representing an Early Available Period in later updates of the 
MPD as long as no PeriodStart time is associated with the Period. 

The attributes and elements contained in the Period element are provided in Table 8-7 along with their semantics. The 
XML syntax or the Period element is provided in Table 8-8. 

Table 8-7: Semantics of Period Element 



Element or Attribute Name 


Use 


Description 


Period 




specifies the information for a single Period. 


@xlink:href 





specifies a reference to an external Period element 


(Sxlink : actuate 




default: 

onRequest 


specifies the processing instructions, which can be either 
"onLoad" or "onRequest". 

This attribute must not be present if the ©xlink : href attribute 
is not present 


@id 


CM 

if 

"MPD@type= 

dynamic" 


specifies a unique identifier for this Period within the Media 
Presentation. 

If the MPDotype is equal to "dynamic", then this attribute shall 
be present and the @id of the Period shall remain unchanged 
over an MPD update. 

If not present, no identifier for the Period is provided. 


©start 





if present, specifies the Perioc/Sfarttime of the Period. 
The PeriodStart X\me is used as an anchor to determine the 
MPD start time of each Media Segment as well as to determine 
the presentation time of each each access unit in the Media 
Presentation timeline. 

If not present, refer to the details above in this clause. 


Oduration 





if present, specifies the duration of the Period to determine the 
PeriodStart X\me of the next Period. 

If not present, refer to the details above in this clause. 


(SbitstreamSwitching 


OD 

Default: 

false 


When set to "true", this is equivalent as if the 
AdaptationSetObitstreamSwitching for each Adaptation 
Set contained in this Period is set to 'true'. In this case, the 
AdaptationSetObitstreamSwitching attribute shall not be 
set to 'false' for any Adaptation Set in this Period. 


BaseURL 


0...N 


specifies a base URL that can be used for reference resolution 
and alternative URL selection. For more details refer to the 
description in clause 8.7. 
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Element or Attribute Name 


Use 


Description 


SegmentBase 


0...1 


specifies default Segment Base information. 
Information in this element is overridden by information in 
AdapationSet . SegmentBase and 
Representation. SegmentBase, if present. 

For more details see clause 8.4.4. 


SegmentList 


0...1 


specifies default Segment List information. 

Information in this element may be overridden by information in 
AdapationSet .SegmentList and 
Representation. SegmentList, if present. 

For more details see clause 8.4.4. 


SegmentTemplate 


0...1 


specifies default Segment Template information. 

Information in this element may be overridden by information in 
AdapationSet .SegmentTemplate and 
Representation. SegmentTemplate, if present. 

For more details see clause 8.4.4. 


AdaptationSet 


0...N 


specifies an Adaptation Set. 

At least one Adaptation Set shall be present in each Period. 
However, the actual element may be present only in a remote 
element if xlink is in use. 

For more details see clause 8.4.3.3. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 

Note that the conditions only holds without using xlink:href. If linking is used, then all attributes are "optional" and 
<minOccurs=0> 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 8-8: Syntax of Period Element 



Period of a presentation 



<xs : complexType 
<xs : sequence> 
<xs : element 
<xs : element 
<xs : element 



"PeriodType": 



" unbounded " / : 



= " " / = 



.5.me^"BaseURL" type = "BaseURLType" minOccurs="0" maxOccurf 
":s me ^"SegmentBase" type = "SegmentBaseType" minOccurs="0"/> 
I ame = " SegmentList" 'pe = "SegmentListType" minOccurs="0"/> 
<xs:element name = "SegmentTemplate" t'/oe; "SegmentTemplateType" minOccurs = 
<xs:element name; "AdaptationSet" "AdaptationSetType" minOccurs="0" 

maxOccurs= "unbounded" / > 
<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 
</xs : sequence> 

<xs : attribute :■ "xlink:href "/> 

<xs : attribute re^ "xlink: actuate" default -"onRequest"/> 
<xs :attribute name="id" type="xs : string"/> 
<xs :attribute name="start" type="xs : duration" /> 
<xs :attribute name=" duration" , pe="xs :duration"/> 
<xs :attribute name="bitStreamSwitching" type="xs :boolean" 
<xs : anyAttribute namespace="##other" processContents- "lax"/> 
</xs : complexType> 



iiefault = "false"/> 



8.4.3 Adaptation Sets and Representations 
8.4.3.1 Overview 

Periods are further subdivided as follows: 

o Each Period contains one or more groups. Groups consist of Adaptation Sets as described in clause 8.4.3.3. 

o In case an Adaptation Set contains multiple media content components, then each media content 
component is described individually as defined in clause 8.4.3.6. 
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o Each Adaptation Set contains one or more Representations as described in clause 8.4.3.4. 

o A Representation may contain one or more Sub-Representations as described in clause 8.4.3.5. 

o Adaptation Sets, Representations and Sub-Representations share common attributes and elements that are 
described in clause 8.4.3.2. 



8.4.3.2 



Common Attributes and Elements 



The elements AdaptationSet, Representation and SubRepresentation have assigned common 
attributes and elements. 

The attributes and elements listed in Table 6 may be present in all three elements. The semantics of these attributes are 
provided in Table 8-9. The XML-syntax is provided in Table 8-10. 

The 'Use' column in Table 8-9 shall be interpreted that an attribute marked with 'M' shall be available for a 
Representation, i.e. it shall either be present in the Representation element, or if not, it shall be in the containing 
AdaptationSet element. An attribute marked with 'O' may be absent in both. 

AdaptationSet element. An attribute marked with 'O' may be absent in both. 

Table 8-9: Common Adaptation Set, Representation and Sub-Representation and Attributes 

and Elements 



Element or Attribute Name 


Use 


Description 


Common attributes and elements 






©profiles 





specifies the profiles which the associated 
Representation(s) conform to the list of IVIedia 
Presentation profiles as described in 7.3. The value shall 
be a subset of the respective value in any higher level of 
the document hierarchy (Representation, Adaptation Set, 
IVIPD). 

If not present, the value is inferred to be the same as in 
the next higher level of the document hierarchy. For 
example, if the value is not present for a Representation, 
then ©profiles at the Adaptation Set level is valid for 
the Representation. 

The same syntax as defined in 8.4.1 shall be used. 


©width 





Specifies the horizontal visual presentation size of the 
video media type in pixel. 

If not present on any level, the value is unknown. 


©height 





Specifies the vertical visual presentation size of the video 
media type in pixel. 

If not present on any level, the value is unknown. 


@f rameRate 





specifies the output frame rate of the video media type in 
the Representation. If the frame rate is varying, the value 
is the average frame over the entire duration of the 
Representation. 

The value is coded as a string, either containing two 
integers separated by a "/", ("F/D"), or a single integer "F". 
The frame rate is the division F/D, or F, respectively, per 
second (i.e. the default value of D is '1 '). 

If not present on any level, the value is unknown. 


©audioSamplingRate 





Either a single decimal integer value specifying the 
sampling rate or a whitespace separated pair of decimal 
integer values specifying the minimum and maximum 
sampling rate of the audio media component type. The 
values are in samples per second. 

If not present on any level, the value is unknown. 
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Element or Attribute Name 


Use Description 


OmimeType 


M 


specifies the MIME type of the concatenation of the 
Initialization Segment, if present, and all consecutive 
Media Segments in the Representation. 


©codecs 


M 


specifies the codecs parameter specifying the media 
types. The codec parameters shall also include the profile 
and level information where applicable. 
The contents of this attribute shall conform to either the 
simp-list or fancy-list productions of RFC6381 [26] clause 
3.2, without the enclosing dquote characters. The codec 
identifier for the media format, mapped into the name 
space for codecs as specified in RFC6381 [26], clause 3.3 
shall be used. 


(SmaximumSAPPeriod 





when present, specifies the maximum SAP interval in 
seconds of all contained media streams, where the SAP 
interval is the maximum time interval between the Tsap of 
any two successive SAPs of types 1 to 3 inclusive of one 
media stream in the associated Representations. 

If not present on any level, the value is unknown. 


OstartWithSAP 





when present and greater than 0, specifies that in the 
associated Representations, each Media Segment starts 
with a SAP of type less than or equal to the value of this 
attribute value in each media stream. 

A Media Segment starts with a SAP in a media stream if 
the stream contains a SAP in that Media Segment, Isau is 
the index of the first access unit that follows Isap and Isap 
is contained in the Media Segment. 

If not present on any level, the value is unknown. 


OmaxPlayoutRate 





Specifies the maximum playout rate as a multiple of the 
regular playout rate, which is supported with the same 
decoder profile and level requirements as the normal 
playout rate. 

If not present on any level, the value is 1 . 


©codingDependency 





When present and "true", for all contained media 
streams, specifies that there is at least one access unit 
that depends on one or more other access units for 
decoding. When present and "false", for any media type, 
there is no access unit that depends on any other access 
unit for decoding (e.g. for video all the pictures are intra 
coded). When not present, there may or may not be 
coding dependency between access units. 


FramePacking 


0...N 


specifies frame-packing arrangement information of the 

video media component type. 

When no FramePacking element is provided for a video 

component, frame-packing shall not be used for the video 

media component. 

For details see 8.6.3.1 and 8.6.3.8. 


AudioChannelConf iguration 


... N 


specifies the audio channel configuration of the audio 
media component type. 

For details see clause 8.6.3.1 and 8.6.3.7. 


ContentProtection 


... N 


specifies information about the use of content protection 

for the associated Representations. 

For details, refer to clause 8.6.3.1 and 8.6.3.2. 


Legend: 

For attributes: M=Mandatory, 0=Optional. 
For elements: <minOccurs>..<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 
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Table 8-10: XML-Syntax of Common Group and Representation and Attributes and Elements 

<!-- RepresentationBase type; extended by other Representation-related types --> 
<xs : complexType "Represent at ionBaseType" abst ra.ct = "true" > 
<xs : sequence minOccurs="0" maxOccurs= "unbounded" > 

<xs: element name="FramePacking" type="DescriptorType" minOccurs="0" 
maxOccurs= "unbounded" /> 

<xs: element narne^"AudioChannelConf iguration" type = "DescriptorType" minOccurs="0" 
maxOc cur s = " unbounded " / > 

<xs: element name="ContentProtection" type="DescriptorType" minOccurs="0" 

maxOccurs= "unbounded" /> 
<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 
</xs : sequence> 

<xs :attribute name="prof lies" type="xs : string"/> 
<xs :attribute name="width" type="xs :unsignedInt"/> 
<xs :attribute name="height" type="xs :unsignedInt"/> 
<xs : attribute name="f rameRate" type="FrameRateType"/> 
<xs : attribute name="audioSamplingRate" type="xs : string"/> 
<xs : attribute name="mimeType" type="xs : string"/> 
<xs : attribute name="codecs" type="xs : string"/> 
<xs : attribute name="maximumSAPPeriod" type="xs :double"/> 
<xs :attribute name="startWithSAP" type="SAPType"/> 
<xs : attribute name="maxPlayoutRate" type="xs : double" /> 
<xs : attribute name="codingDependency" type="xs :boolean"/> 
<xs : anyAt tribute namespace ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Stream Access Point type enumeration --> 
<xs : simpleType name="SAPType" > 

<xs : restriction base="xs :unsignedlnt" > 
<xs iminlnclusive value="0"/> 
<xs :maxlnclusive value="6"/> 
</xs : restriction> 
</xs : simpleType> 

<!-- Type for Frame Rate --> 

<xs : simpleType name="FrameRateType"> 

<xs : restriction base="xs : string" > 

<xs ipattern J i.. .- " [0-9] * [0-9] {/ [0-9] * [0-9] ) ?"/> 

</xs : restriction> 
</xs : simpleType> 



8.4.3.3 Adaptation Set 

An Adaptation Set is described by an AdaptatlonSet element. AdaptatlonSet elements are contained in a 
Period element. An Adaptation Set contains alternate Representations, i.e. only one Representation within an 
Adaptation Set is expected to be presented at a time. All Representations contained in one Adaptation Set represent the 
same media content components and therefore contain media streams that are considered to be perceptually equivalent. 

Representations are arranged into Adaptation Sets according to their to the media content component properties of the 
media content components present in the Representations, namely 

o the language as described by the @lang attribute, 

o the media component type described by the ©contentType attribute, 

o the role property as described by the Role elements, 

o the accessibility property as described by the Accessibility elements, 

o the viewpoint property as described by the Viewpoint elements, 

o the rating property as described by the Rating elements. 

Representations shall appear in the same Adaptation Set if and only if they have identical values for all of these media 
content component properties for each media content component. 

The values for the elements Role, Accessibility, Viewpoint and Rating are typically not provided within 
this specification. However, a number of simple schemes are defined in Annex C. 
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If there exist multiple media content components then the properties of each media content component shall be 
described by a separate ContentComponent element as defined in 8.4.3.6. The ContentComponent element 
shares common elements and attributes with the AdaptatlonSet element. Default values or values applicable to all 
media content components may be provided directly in the AdaptationSet element. Attributes present in the 
AdaptatlonSet shall not be repeated in the ContentComponent element. 

The AdaptatlonSet element may contain default values for elements and attributes associated to the contained 
Representations. Any of the common attributes defined in clause 8.4.3.2 shall only be present either in the 
AdaptatlonSet element or in the Representation element, but not in both. 

The AdaptatlonSet element also supports the description of ranges for the ©bandwidth, ©width, ©height 
and @f rameRate attributes associated to the contained Representations, which provide a summary of all values for all 
the Representations within this Adaptation Set. The Representations associated with an AdaptatlonSet element 
shall not contain values outside the ranges documented for that Adaptation Set. 

Adaptation Sets maybe further arranged into groups using the ©group attribute. The semantics of this grouping is that 
the media content within one Period is represented by: 

1) either one Representation from group 0, if present, 

2) or the combination of at most one Representation from each non-zero group. 

If the AdaptatlonSet©group attribute is not present then all Representations in this Adaptation Set are assigned to 
a non-zero group specific to this Adaptation Set. 

The semantics of the attributes and elements within an AdaptatlonSet element are provided in Table 8-11. The 
XML-syntax of the AdaptatlonSet element is provided in Table 8-12. 

Table 8-11 : Semantics of AdaptatlonSet element 



Element or Attribute Name 


Use 


Description 


AdaptatlonSet 




Adaptation Set description 




@xlink:href 





specifies reference to a remote AdaptatlonSet element 




Oxlink : actuate 


OD 
default: 

'onRequest' 


specifies the processing instructions, which can be either 

"onLoad" or "onRequest". 




@id 





specifies unique identifier for this Adaptation Set within 
the Period. The attribute shall be unique in the scope of 
the containing Period. 

The attribute shall not be present in a remote element. 

If not present, no identifier for the Adaptation Set is 
specified. 




©group 





specifies an identifier for the group that is unique in the 
scope of the containing Period. 

For details refer to the description above in this clause. 




CommonAttributesElements 




specifies the common attributes and elements (attributes 
and elements from base type RepresentationBaseType) 
For details see clause 8.4.3.2. 




(Slang 





Declares the language code(s) for this Adaptation Set. 
The syntax and semantics according to IETF RFC 5646 
[13] shall be used. 

If not present, the language code may be defined for each 
media component or it may be unknown. 




©contentType 





specifies the media content component type for this 
Adaptation Set. A value of the top-level Content-type 
'type' value as defined in RFC1521 [27], section 4 shall be 
taken. 

If not present, the media content component type may be 
defined for each media component or it may be unknown. 
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Element or Attribute Name 


Use 


Description 




OminBandwidth 





specifies minimum bandwidtli value in all Representations 
in this Adaptation Set. This value has the same units as 
the Obandwidth attribute. 




©maxBandwidth 





specifies maximum bandwidth value in all 
Representations in this Adaptation Set. This value has the 
same units as the obandwidth attribute. 




OminWidth 





specifies minimum width value in all Representations in 
this Adaptation Set. This value has the same units as the 
owidth attribute. 

If not present, the value is unknown. 




OmaxWidth 





specifies maximum width value in all Representations in 

this Adaptation Set. 

This value has the same units as the ©width attribute. 

If not present, the value is unknown. 




OminHeight 





specifies minimum height value in all Representations in 

this Adaptation Set. 

This value has the same units as the ©height attribute. 

If not present, the value is unknown. 




OmaxHeight 





specifies maximum height value in all Representations in 

this Adaptation Set. 

This value has the same units as the ©height attribute. 

If not present, the value is unknown. 




(SminFrameRate 





specifies minimum frame rate value in all Representations 

in this Adaptation Set. 

This value is encoded in the same format as the 

@f rameRate attribute. 

If not present, the value is unknown. 




OmaxFrameRate 





specifies maximum frame rate value in all 
Representations in this Adaptation Set. 
This value is encoded in the same format as the 
@f rameRate attribute. 

If not present, the value is unknown. 




©segment Al ignment 





when not set to "false", this specifies that for any two 
Representations, X and Y, within the same Adaptation 
Set, the m-th Segment of X and the n-th Segment of Y are 
non-overlapping (as defined in section 9.2.5.2) whenever 
m is not equal to n. 

For Adaptation Sets containing Representations with 
multiple media content components, this attribute value 
shall be either ■ true' or 'false'. 

For Adaptation Sets containing Representations with a 
single media content component, when two 
AdaptationSet elements within a Period share the 
same integer value for this attribute, then for any two 
Representations, X and Y, within the union of the two 
Adaptation Sets, the m-th Segment of X and the n-th 
Segment of Y are non-overlapping (as defined in 9.2.5.2) 
whenever m is not equal to n. 




ObitStreamSwitching 





When this flag is set to "true", the following applies: 

• All Representations in the Adaptation Set shall 

have the same number Mof Media Segments; 

• Let Ri, R2, .... Rn be all the Representations within 

the Adaptation Set. 
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Element or Attribute Name 


Use 


Description 








• Let 

o Sij, for / > 0, be the f Media Segment in 
the /■"' Representation (i.e., Ri), and 

o Si,o be the Initialization Segment in the 
/*' Representation 

• The sequence of 

Sif1),0, Sif1),1, Sif2),2, ..., Sifk),k, ..., Si(M),M, 

wherein any i(k) for all k values in the range of 1 
to M, respectively, is an integer value in the 
range of 1 to A/, results in a "conforming 
Segment sequence" as defined in section 9.2.5.3 
with the media format as specified in the 
OmimeType attribute. 




©subsegmentAlignment 


OD 

default: 

false 


If the ©subsegmentAlignment for an Adaptation Set is 
set to other than "false", all following conditions shall be 
satisfied: 

• Each Media Segment shall be indexed (i.e. it 

contains a Segment index) 

• For any two Representations, X and Y, within the 

same Adaptation Set, the m-th Subsegment of X 
and the n-th Subsegment of Y are non- 
overlapping (as defined in section 9.2.5.2) 
whenever m is not equal to n. 

For Adaptation Sets containing Representations with a 
single media content component, when two 
AdaptationSet elements within a Period share the 
same integer value for this attribute, then for any two 
Representations, X and Y, within the union of the two 
Adaptation Sets, the m-th Subsegment of X and the n-th 
Subsegment of Y are non-overlapping (as defined in 
section 9.2.5.2) whenever m is not equal to n. 




Osubsegment Start sWithSAP 


OD 

default: 




when greater than 0, specifies that each Subsegment with 
SAP_type greater than starts with a SAP of type less 
than or equal to the value of 

osubsegmentstartswithSAP. A Subsegment starts 
with SAP when the Subsegment contains a SAP, and for 
the first SAP, Isau is the index of the first access unit that 
follows IsAP, and Isap is contained in the Subsegment. 

The semantics of OsubsegmentStartsWithSAP equal 
to are unspecified. 




Accessibility- 


... N 


specifies information about accessibility scheme 
For more details refer to section 8.6.3.1 and 8.6.3.6. 




Role 


... N 


specifies information on role annotation scheme 
For more details refer to section 8.6.3.1 and 8.6.3.3. 




Rating 


... N 


specifies information on rating scheme. 

For more details refer to section 8.6.3.1 and 8.6.3.4. 




Viewpoint 


... N 


specifies information on viewpoint annotation scheme. 
For more details refer to section 8.6.3.1 and 8.6.3.5. 
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Element or Attribute Name 


Use 


Description 




ContentComponent 


0...N 


specifies the properties of one media content component 
contained in this Adaptation Set. 

For more details refer to section 8.4.3.6. 




BaseURL 


0...N 


specifies a base URL that can be used for reference 
resolution and alternative URL selection. For more details 
refer to the description in section 8.7. 




SegmentBase 


0...1 


specifies default Segment Base information. 

Information in this element is overridden by information in 
the Representation. SegmentBase, if present. 

For more details see section 8.4.4. 




SegmentList 


0...1 


specifies default Segment List information. 

Information in this element is overridden by information in 
the Representation. SegmentList, if present. 

For more details see section 8.4.4. 




SegmentTemplate 


0...1 


specifies default Segment Template information. 

Information in this element is overridden by information in 
the Representation. SegmentTemplate, if present. 

For more details see section 8.4.4. 




Representation 


... N 


specifies a Representation. 

At least one Representation element shall be present in 
each Adaptation Set. The actual element may however be 
part of a remote element. 

See subclause 8.4.3.4. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory, F=Fixed. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 

Note that the conditions only holds without using xlink:href. If linking is used, then all attributes are "optional" and 
<minOccurs=0> 
Elements are bold; attributes are non-bold and preceded with an @, List of elements and attributes is in italics bold 



Table 8-12: XML-Syntax of AdaptationSet element 



<!-- Group to contain information common to an adaptation set; 

extends RepresentationBaseType --> 
<xs : complexType '' : "AdaptationSetType" > 
<xs : complexContent> 

<xs : extension "RepresentationBaseType" > 
<xs : sequence> 

<xs: element i--- "Accessibility" typ>e-"DescriptorType" minOccurs = "0" 
maxOc cur s = " unbounded " / > 

<xs: element ::_.iiiiL-^"Role" L\-pe = "DescriptorType" minOccurs = "0" 
maxOccur s = "unbounded " / > 

<xs: element name="Rating" type="DescriptorType" minOccurs="0" 
maxOccur s = "unbounded " / > 

<xs:element name^"Viewpoint" type=:"DescriptorType" minOccurs = "0" 
maxOccur s = "unbounded " / > 

<xs: element name ^"ContentComponent" type="ContentComponentType" minOccurs="0" 
maxOccurs= "unbounded" /> 

<xs: element name =" BaseURL" type="BaseURLType" minOccurs="0" 
maxOccurs= "unbounded" /> 

<xs: element name=" SegmentBase" type="SegmentBaseType" minOccurs="0"/> 

<xs: element name=" SegmentList" type="SegmentListType" minOccurs="0"/> 

<xs: element name=" SegmentTemplate" type="SegmentTemplateType" minOccurs="0"/> 

<xs:element name="Representation" type="RepresentationType" minOccurs="0" 

maxOccurs= "unbounded" /> 
<xs : any namespace="##other" processContents="lax" minOccurs="0" 
maxOccurs= "unbounded" /> 
</xs : sequence> 

<xs : attribute ref ="xlink:href "/> 

<xs :attribute ref ="xlink: actuate" def ault="onRequest"/> 
<xs : attribute name="id" type="xs : string"/> 
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<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name- 

<xs : attribute name 
default="false"/> 

<xs : attribute name- 

<xs : attribute name- 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 



7> 



=" group" type="xs :unsignedlnt "/> 

:"lang" type="xs : language"/> 

:"contentType" type="xs : string" /> 

:"minBandwidth" type="xs :unsignedlnt'' 

:"maxBandwidth" type="xs lunsignedlnt ' 

:"minWidth" type="xs :unsignedInt"/> 

="maxWidth" type="xs :unsignedInt"/> 

:"minHeight" type="xs :unsignedInt"/> 

:"maxHeight" type="xs :unsignedInt"/> 

: "minFrameRate " type= " FrameRateType " / > 

: "maxFrameRate " type= " FrameRateType " / > 

:"segmentAlignment" type =" Condi tionalUintType" def ault=" false" /> 

: " subsegment Alignment " type = " Condi tionalUintType " 

="subsegmentStartsWithSAP" type="SAPType" default="0"/> 
="bitStreamSwitching" type="xs : boolean" /> 



<!-- Conditional Unsigned Integer {unsignedint or boolean) 
<xs : simpleType name="ConditionalUintType" > 

<xs:union rnernberTypes = "xs : unsignedint xs :boolean"/> 
</xs : simpleType> 



8.4.3.4 Representation 

Representations are described by the Representation element. Representation elements are contained in an 
AdaptatlonSet element. 

A Representation is one of the alternative choices of the complete set or subset of media content components 
comprising the media content during the defined Period. 

A Representation starts at the start of the Period PeriodStart and continues to the end of the Period, i.e. the start of the 
next Period or the end of the Media Presentation. 

Each Representation includes one or more media streams, where each media stream is an encoded version of one media 
content component. 

A Representation consists of one or more Segments. 

Each Representation either shall contain an Initialization Segment or each Media Segment in the Representation shall 
be self-initializing, i.e. the Media Segment itself conforms to the media type as specified in the @mimeType attribute 
for this Representation. 

The concatenation of the Initialization Segment, if present, and all consecutive Media Segments in one Representation 
shall represent a conforming Segment sequence as defined in section 9.2.5.3 conforming to the media type as specified 
in the @mimeType attribute for this Representation. 

The semantics of the attributes and elements within a Representation are provided in Table 8-13. The XML-syntax of 
the Representation element is provided in Table 8-14. 

Table 8-13: Semantics of Representation element 



Element or Attribute Name 


Use 


Description 


Representation 




This element contains a description of a Representation. 


@id 


M 


specifies an dentifier for this Representation. The identifier 
shall be unique within a Period unless the Representation is 
functionally identically to another Representation in the 
same Period. 
The identifier shall not contain whitespace characters. 

If used in the template-based URL construction as defined in 
section 8.4.4.4, the string shall only contain characters that 
are permitted within an H 1 1 P-URL according to RFC 1738 
[19]. 


©bandwidth 


M 


Consider a hypothetical constant bitrate channel of 
bandwidth with the value of this attribute in bits per second 
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(bps). Then, if the Representation is continuously delivered 
at this bitrate, starting at any SAP that is indicated either by 
ostartwithSAP or by any Segment Index box, a client can 
be assured of having enough data for continuous playout 
providing playout begins after ominBuf f erTime * 
obandwidth bits have been received (i.e. at time 
OminBuf f erTime after the first bit is received). 



OqualityRanking 



o 



specifies a quality ranking of the Representation relative to 
other Representations in the Adaptation Set. Lower values 
represent higher quality content. If not present then the 
ranking is undefined. 



(SmediaStreamStructureld 



o 



The attribute may be present for Representations containing 
video and its semantics are unspecified for any other type of 
Representations. 

If present, the attribute OmediaStreamStructureld 
specifies a whitespace-separated list of media stream 
structure identifier values. If media streams share the same 
media stream structure identifier value, the media streams 
shall have the following characteristics: 

• The media streams have the same number of Stream 

Access Points of type 1 to 3. 

• The values of Tsap, Tdec, Tept, and Tptf of the /-th 

SAP of type 1 to 3 in one media stream are 
identical to the values of Tsap, Tdec, Tept, and Tptf, 
respectively, of the /-th SAP of type 1 to 3 in the 
other media streams for any value of /from 1 to the 
number of SAPs of type 1 to 3 in any of the media 
streams. 

• A media stream formed by concatenating the media 

stream of a first Representation until Isau 
(exclusive) of the ;-th SAP of type 1 to 3 and the 
media stream of a second Representation (having 
the same media stream structure identifier value as 
for the first Representation) starting from the Isau 
(inclusive) of the /-th SAP of type 1 to 3 conforms to 
the specification in which the media stream format 
is specified for any value of /from 1 to the number 
of SAPs of type 1 to 3 in either media stream. 
Furthermore, the decoded pictures have an 
acceptable quality regardless of type of the Stream 
Access Point access unit used. 

All media stream structure identifier values for one 
Adaptation Set shall differ from those of another Adaptation 
Set. 

If not present, then for this Representation no similarities to 
other Representations are known. 

Note: Indicating multiple media stream structure 
identifier values for a Representation can be useful in 
cases where switching between Representations A and 
B as well as between Representations B and C is 
allowed at non-IDR intra pictures, but switching between 
Representations A and C would cause too severe a 
degradation in the quality of the leading pictures and is 
hence not allowed. To indicate these permissions and 
restrictions. Representation A would contain 
OmediaStreamStructureld equal to '1', 
Representation B would contain 
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(SmediaStreamStructureld equal to '1 2', and 
Representation C would contain 
SmediaStreamStructureld equal to '2' 


CoiamonAttributesElements 




Common Attributes and Elements (attributes and elements 
from base type Representation BaseType), for more 
details see clause 8.4.3.2. 


BaseURL 


0...N 


specifies a Base URL that can be used for reference 
resolution and alternative URL selection. For more details 
refer to the description in section 8.7. 


SubRepresentation 


... N 


specifies rovides information about a sub-representation that 
is embedded in the containing Representation. For more 
details see clause 8.3.4.5. 


SegmentBase 


0...1 


specifies default Segment Base information. 
For more details see 8.4.4. 


SegmentList 


0... 1 


specifies the Segment List information. 
For more details see 8.4.4. 


SegmentTemplate 


0... 1 


specifies the Segment Template information. 
For more details see 8.4.4. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CIVI=Conditionally IVIandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @, List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-14: XML-Syntax of Representation element 



<!-- A Representation of the presentation content for a specific Period --> 
<xs : complexType ri.v -"Represent at ionType" > 
<xs : complexContent> 

<xs : extension base="RepresentationBaseType" > 
<xs : sequence> 

<xs:element iame="BaseURL" type="BaseURLType" minOccurs="0" 
maxOccur s = "unbounded " / > 

<xs : element iame="SubRepresentation" type="SubRepresentationType" 
minOccurs= " " /> 

<xs:element iame="SegmentBase" type="SegmentBaseType" minOccurs="0"/> 
<xs:element name=" SegmentList" Lype="SegmentListType" minOccurs="0"/> 
<xs:element name="SegmentTemplate" type="SegmentTemplateType" minOccurs="0"/> 
</xs : sequence> 

<xs :attribute name="id" type="StringNoWhitespaceType" use="required"/> 
<xs : attribute name="bandwidth" type="xs :unsignedlnt" use:- "required"/> 
<xs : attribute name="qualityRanking" type="xs :unsignedInt"/> 
<xs : attribute name="mediaStreamStructureId" ype^"StringVectorType"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- String without white spaces --> 

<xs : simpleType name="StringNoWhitespaceType" > 

<xs : restriction base="xs : string" > 

<xs:pattern - ". -^ - " [*\r\n\t \p{z}]*"/> 

</xs : restriction> 
</xs : simpleType> 

<!-- Whitespace-separated list of strings --> 
<xs : simpleType : ame = "StringVectorType"> 

<xs : list itemType="xs : string" /> 
</xs : simpleType> 



8.4.3.5 



Sub-Representation 



Sub-Representations are embedded in regular Representations and are described by the SubRepresentation 
element. SubRepresentation elements are contained in a Representation element. 

The SubRepresentation element describes properties of one or several media content components that are 
embedded in the Representation. It may for example describe the exact properties of an embedded audio component 
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(language, codec, etc.), an embedded sub-title (language) or it may describe some embedded lower quality video layer 
(e.g. some lower frame rate, etc.). 

Sub-Representations and Representation share some common attributes and elements. 

In case the ©level attribute is present in the SubRepresentatlon element, 

• Sub-Representations provide the ability for accessing a lower quality version of the Representation in which they 

are contained. In this case, Sub-Representations for example allow extracting the audio track in a multiplexed 
Representation or may allow for efficient fast-forward or rewind operations if provided with lower frame rate. 

• the Initialization Segment and/or the Media Segments shall provide sufficient information such that the data can 

be easily accessed through HTTP partial GET requests. The details on providing such information shall be 
defined by the media format in use. For media formats defined in this specification, the Subsegment Index as 
defined in section 9.2.3.3 shall be used. 

If the ©level attribute is absent, then the SubRepresentatlon element is solely used as a more detailed 
descriptor for media streams that are embedded in the Representation. 

The semantics of the attributes and elements within a Sub-Representation are provided in Table 8-15. The XML-syntax 
of the Sub-Representation element is provided in Table 8-16. 

Table 8-15: Semantics of SubRepresentatlon element 



Element or Attribute Name 


Use 


Description 


SubRepresentatlon 




This element specifies a Sub-Representation. 


(Slevel 





Specifies the sub-representation level. If ©level attribute is 
present a Subsegment Index as defined in section 9.2.3.3 
shall be available for each Media Segment in the containing 
Representation. 

If not present, then the SubRepresentatlon element is 
solely used to provide a more detailed description for media 
streams that are embedded in the Representation. 


OdependencyLevel 





specifies the set of Sub-Representations within this 
Representation that this Sub-Representation depends on in 
the decoding and/or presentation process as a whitespace- 
separated list of ©level values. 

If not present, the Sub-Representation can be decoded and 
presented independently of any other Representation. 


©bandwidth 





Identical to the ©bandwidth definition in Representation, 
but applied to this Sub-Representation. This attribute shall 
be present in case the ©level attribute is present. 


Ocontent Component 





if present, specifies the set of all media content components 
that are contained in this Sub-Representation as a 
whitespace-separated list of values of 
ContentComponent©id values. 

if not present, the Sub-Representation is not assigned to a 
media content component. 


CommonAttributesElements 




Common Adaptation Set, Representation and Sub- 
Representation attributes and elements (attributes and 
elements from base type Representation BaseType), for 
details see clause 8.3.4.2. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded witli an @, List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-16: XML-Syntax of SubRepresentatlon element 



<!-- SubRepresentatlon of the presentation content for a specific Period; 
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extends RepresentationBaseType --> 
<xs : complexType "SubRepresentationType" > 
<xs : complexContent> 

<xs : extension hase="RepresentationBaseType" > 

<xs : attribute name="level" type="xs :unsignedInt"/> 
<xs :attribute name="dependencyLevel" type="UIntVectorType"/> 
<xs :attribute name="bandwidth" type="xs :unsignedInt"/> 
<xs : attribute name=" content Component" type="StringVectorType"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- Type for space delimited list of strings --> 
<xs : simpleType name="UIntVectorType" > 

<xs : list itemType="xs : unsignedint " /> 
</xs : simpleType> 



8.4.3.6 



Content Component 



Each Adaptation Set contains one or more media content components. The properties of each media content component 
are described by a ContentComponent element or may be described directly on the AdaptatlonSet element if 
only one media content component is present in the Adaptation Set. ContentComponent elements are contained in 
an AdaptatlonSet element. 

The semantics of the attributes and elements within a ContentComponent element are provided in Table 8-17. The 
XML syntax of the ContentComponent element is provided in Table 8-18. 

Table 8-17 — Semantics of ContentComponent element 



Element or Attribute Name 


Use 


Description 


ContentComponent 




description of a content component 


@id 





specifies an identifier for this media component. The attribute 
shall be unique in the scope of the containing Adaptation Set. 


@lang 





same semantics as in Table 8-1 1 for ®lang attribute 


OcontentType 





same semantics as in Table 8-1 1 for ©contentType attribute 


©par 





same semantics as in Table 8-1 1 for ®par attribute. 


Accessibility- 


... N 


same semantics as in Table 8-1 1 for Accessibility element 


Role 


... N 


same semantics as in Table 8-1 1 for Role element 


Rating 


... N 


same semantics as in Table 8-1 1 for Rating element 


Viewpoint 


... N 


same semantics as in Table 8-1 1 for viewpoint element 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory, F=Fixed. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @, List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-18 — XML-Syntax of ContentComponent element 



<!-- Content Component --> 

<xs : complexType iame="ContentComponentType" > 

<xs : sequence> 

<xs:element name="Accessibility" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs:element name="Role" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs:element name="Rating" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs:element name= "Viewpoint" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 

</xs : sequence> 

<xs : attribute name="id" type="xs :unsignedInt"/> 

<xs : attribute :iame = "lang" tvDe = "xs : language"/> 
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<xs : attribute name="contentType" type="xs : string"/> 
<xs : anyAt tribute name space ="##other" processContents="lax"/> 
</xs : complexType> 



8.4.4 Segments and Segment Information 
8.4.4.1 General 

A Segment is the smallest addressable unit described by an MPD and has a defined format. Segment formats are 
defined in section 9. This clause defines the MPD information for Segments. 

Specifically, a Segment shall referenced by an HTTP-URL included in the MPD, where an HTTP-URL is defined as an 
<absolute-URI> according to RFC 3986 [17], Clause 4.3, with a fixed scheme of 'http :' or "https : ", possibly 
restricted by a byte range if a range attribute is provided together with the URL. The byte range shall be expressed as a 
byte -range -spec as defined in RFC 2616 [9], Clause 14.35.1. It is restricted to a single expression identifying a 
contiguous range of bytes. 

Each Segment referenced through an HTTP-URL in the MPD is associated with a Segment availability interval, i.e. a 
time window in wall-clock time at which the Segments can be accessed via the HTTP -URL. The Segment availability 
interval window is described by a Segment availability start time and a Segment availability end time. 

Representations are assigned Segment Information through the presence of the elements BaseURL, SegmentBase, 
SegmentTemplate and/or SegmentList. The Segment Information provides information on the location, 
availability and properties of all Segments contained in one Representation. Specifically, information on the presence 
and location of Initialization, Media, Index and Bitstream Switching Segments is provided. 

The elements SegmentBase, SegmentTemplate and SegmentList may be present in the Representation 
element itself In addition, to express default values, they may be present in the Period and AdaptatlonSet 
element. At each level at most one of the three, SegmentBase, SegmentTemplate and SegmentList shall be 
present. Further, if SegmentTemplate or SegmentList on one level of the hierarchy, then the other one shall not 
be present on any lower hierarchy level. 

SegmentBase, SegmentTemplate and SegmentList shall inherit attributes and elements from the same 
element on a higher level. If the same attribute or element is present on both levels, the one on the lower level shall take 
precedence over the one on the higher level. 

Several mechanisms are available to specify the Segment Information. Specifically, each Representation shall have 
assigned exactly one of the following choices to determine the Segment Information, either by direct presence in the 
Representation element or by inheritance from the higher levels: 

o one or more SegmentList elements - for syntax and semantics refer to section 8.4.4.2.3. 

o one SegmentTemplate element - for syntax and semantics refer to section 8.4.4.2.4. 

o one or more BaseURL elements, at most one SegmentBase element, and no SegmentTemplate or 
SegmentList element. The SegmentBase element is defined in section 8.4.4.2.2. 

All three elements SegmentBase, SegmentTemplate and SegmentList share common elements based on the 
SegmentBase element. Furthermore, SegmentTemplate and SegmentList share common attributes and 
elements. The common information is defined in section 8.4.4.2.2. 

The derivation and details of Initialization and Media Segment information based on the above information is provided 
in section 8.4.4.3. 
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8.4.4.2 



Segment Information Description 



8.4.4.2.1 



Segment base information 



The SegmentBase element contains information that is sufficient is only a single Media Segment is provided per 
Representation and the Media Segment URL is included in the BaseURL element. 

In case multiple Media Segments are present, either a SegmentList or a SegmentTemplate is used that share the 
multiple Segment base information as provided in Table 8-20. 

If the Representation contains more than one Media Segment, then the attribute ©duration shall be present. 
Segments described by the Segment base information are referenced by an HTTP-URL conforming to the type 
URLType as defined in Table 8-21. 

The semantics of the attributes and elements for the SegmentBase element and the Segment base information are 
provided in Table 8-19 and the multiple Segment base information in Table 8-20. The XML syntax of the Segment 
Base Information is provided in Table 8-22. 

Table 8-19 — Semantics of SegmentBase element and Segment Base Information type 



Element or Attribute Name 


Use 


Description 


SegmentBase 

Segment Base Information 




specifies Segment base element as well as the 
type for the Segment base information. 


©timescale 


O 


specifies the timescale in units per seconds to be 
used for the derivation of different real-time 
duration values in the Segment Information. 

If not present on any level, it shall be set to 1 . 

NOTE This may be any frequency but 
typically is the media clock frequency of one of 
the media streams (or a positive integer 
multiple thereof). 


©presentationTimeOf f set 


O 


specifies the presentation time offset of the 
Representation relative to the start of the Period. 

The value of the presentation time offset in 
seconds is the division of the value of this attribute 
and the value of the @timescale attribute. 

If not present on any level, the value of the 
presentation time offset is 0. 


OindexRange 


o 


specifies the byte range that contains the 
Segment Index in all Media Segments of the 
Representation. 

The byte range shall be expressed and formatted 
as a byte-range-spec as defined in RFC 2616 [9], 
Clause 14.35.1. It is restricted to a single 
expression identifying a contiguous range of 
bytes. 

If not present the value is unknown. 


OindexRangeExact 


OD 

default: 

"false" 


when set to 'true' specifies that for all Segments in 
the Representation, the data outside the prefix 
defined by oindexRange contains the data 
needed to access all access units of all media 
streams syntactically and semantically. 

This attribute shall not be present if 
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(SindexRange is absent. 


Initialization 


0... 1 


specifies the URL including a possible byte range 
for the Initialization Segment. 

For the type definition refer to Table 8-21 . 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-20 — Semantics of MultipleSegmentBaselnf ormation type 



Element or Attribute Name 


Use 


Description 


MultipleSegmentBaselnf ormation 




specifies multiple Segment base information. 


Oduration 





If present, specifies the constant approximate 
Segment duration. 

All Segments within this Representation element 
have the same duration unless it is the last 
Segment within the Period, which could be 
significantly shorter. 

The value of the duration in seconds is the 
division of the value of this attribute and the value 
of the otimescale attribute associated to the 
containing Representation. 

For more details refer to clause 8.4.4.4.3. 


(SstartNumber 





specifies the number of the first Media Segment in 
this Representation in the Period. 

For more details refer to clause 8.4.4.4.3. 


Segment Base Information 




specifies Segment base information. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-21 — Semantics of elements of type URLType 



Element or Attribute Name 


Use 


Description 


Element of type URLType 




defines an HTTP-URL 


OsourceURL 





specifies the source URL part and shall be formatted either 
as an <absolute-URi> according to RFC 3986, Clause 
4.3, with a fixed scheme of 'http' or "https" or as a 
<relative-ref > according to RFC 3986, Clause 4.2. 

If not present, then any BaseURL element is mapped to the 
@sourceURL attribute and the range attribute shall be 
present. 


Orange 





specifies the byte range restricting the above HTTP-URL. 
The byte range shall be expressed and formatted as a 
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byte-range-spec as defined in RFC 2616, Clause 
14.35.1 . It is restricted to a single expression identifying a 
contiguous range of bytes. 

If not present, the element refers to the entire resource 
referenced in the ©sourceURL attribute. 



Legend: 

For attributes: IVI=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-22 — XML-Syntax of Segment Base Information 



<!-- Segment information base --> 

<xs : complexType :ame="SegmentBaseType" > 

<xs : sequence> 

<xs:element name="Initialization" type="URLType" minOccurs="0"/> 

<xs:any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 

</xs : sequence> 

<xs : attribute name="timescale" type="xs :unsignedInt"/> 

<xs : attribute name="presentationTimeOf f set" type="xs :unsignedInt"/> 

<xs : attribute name="indexRange" type="xs : string"/> 

<xs : attribute name = "indexRangeExact" type="xs ■.boolean" def ault = "f alse"/> 

<xs : anyAttribute namespace="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Multiple Segment information base --> 
<xs : complexType name="MultipleSegmentBaseType" > 
<xs : complexContent> 

<xs : extension base="SegmentBaseType" > 
<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 
</xs : sequence> 

<xs : attribute iame=" duration" type="xs :unsignedInt"/> 
<xs : attribute iame="startNumber" type="xs :unsignedInt"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- Segment Info item URL/range --> 
<xs : complexType ".ame = "URLType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 

</xs : sequence> 

<xs : attribute name="sourceURL" type="xs : anyURI"/> 

<xs : attribute name="range" type="xs : string"/> 

<xs : anyAttribute iamespace="##other" processContents="lax"/> 
</xs : complexType> 



8.4.4.2.2 



Segment list 



The Segment list is defined by one or more SegmentList elements. Each SegmentLlst element itself contains a 
list of SegmentURL elements for a consecutive list of Segment URLs. Each Segment URL may contain the Media 
Segment URL and possibly a byte range. The Segment URL element may also contain an Index Segment. 

The semantics of the attributes and elements for the Segment list are provided in Table 8-23. The XML syntax of the 
Segment list is provided in Table 8-24. 

Table 8-23 — Semantics of SegmentLlst element 



Element or Attribute Name 


Use 


Description 


SegmentLlst 




specifies Segment information. 


@xlink:href 





specifies a reference to remote SegmentLlst 
element 
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Oxlink : actuate 


OD 

default: 
"onRequest" 


specifies the processing set, can be either 

"onLoad" or "onRequest" 


MultipleSegmentBaselnfona 
ation 




Multiple Segment base information as defined in 
Table 8-20. 


SegmentURL 


... N 


specifies a iVIedia Segment URL and a possibly 
present Index Segment URL 


©media 





in combination with the ©mediaRange attribute 
specifies the HTTP-URL for the Media Segment. 

It shall be formatted as an <absolute-URi> 
according to RFC 3986, Clause 4.3, with a fixed 
scheme of 'http' or "https" or as a <relative- 
ref > according to RFC 3986, Clause 4.2. 

If not present, then any BaseURL element is 
mapped to the ®media attribute and the range 
attribute shall be present. 


©mediaRange 





specifies the byte range within the resource 
identified by the ©media corresponding to the 
Media Segment. 

The byte range shall be expressed and formatted 
as a byte-range-spec as defined in RFC 2616, 
Clause 14.35.1. It is restricted to a single 
expression identifying a contiguous range of 
bytes. 

If not present, the Media Segment is the entire 
resource referenced by the ©media attribute. 


OindexRange 





specifies the byte range of the Segment Index in 
Media Segment. 

The byte range shall be expressed and formatted 
as a byte-range-spec as defined in RFC 2616, 
Clause 14.35.1. It is restricted to a single 
expression identifying a contiguous range of 
bytes. 

If not present, the Index Segment then no Index 
Segment information is provided for this Media 
Segment. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CIVI=Conditionally IVIandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 

Note that the conditions only holds without using @xlink:href . If linl<ing is used, then all attributes are "optional" 

and <minOccurs=0> 
Elements are bold; attributes are non-bold and preceded with an @. List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-24 — XML-Syntax of segmentList element 



<!-- Segment List --> 

<xs : complexType name="SegmentListType" > 
<xs : complexContent> 

<xs : extension base="MultipleSegmentBaseType" > 
<xs : sequence> 

<xs:element name =" SegmentURL" L ype="SegmentURLType" minOccurs="0" maxOccurs="unbounded"/> 
</xs : sequence> 

<xs : attribute ref ="xlink:href "/> 
<xs : attribute ref ="xlink: actuate" def ault="onRequest"/> 
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</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- Segment URL --> 

<xs : complexType riame = "SegmentURLType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 

</xs : sequence> 

<xs : attribute name="media" type="xs : anyURI"/> 

<xs : attribute name="mediaRange" type="xs : string" /> 

<xs : attribute nanie = "indexRange" type="xs : string" /> 

<xs : anyAt tribute iamespace="##other" processContents="lax"/> 
</xs : complexType> 



8.4.4.2.3 



Segment template 



The Segment template is defined by the SegmentTemplate element. In this case, specific identifiers that are 
substituted by dynamic values assigned to Segments, to create a list of Segments. The substitution rules are provided in 
section 8.4.4.4. 

The semantics of the attributes and elements for the Segment list are provided in Table 8-25. The XML syntax of the 
Segment Information is provided in Table 8-26. 

Table 8-25 — Semantics of SegmentTemplate element 



Element or Attribute Name 


Use 


Description 


SegmentTemplate 




specifies Segment template information. 


MultipleSegmentBaselnformation 




Provides the Multiple Segment base information as 
defined in Table 8-20. 


Omedia 





specifies the template to create the Media Segment 
List. 

For more details refer to clause 8.4.4.3.3. 


Oinitialization 





specifies the template to create the Initialization 
Segment. The $Number$ identifier shall not be 
included. 

For more details refer to clause 8.4.4.3.2. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 

For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. List of elements and attributes is in italics bold 
referring to those taken from the base type that has been extended by this type. 



Table 8-26 — XML-Syntax of SegmentTemplate element 



<!-- Segment Template --> 

<xs : complexType : =ri.v^ "SegmentTemplateType" > 
<xs : complexContent> 

<xs : extension base="MultipleSegmentBaseType" > 
<xs : attribute name="media" type="xs : string"/> 
<xs : attribute iame="initialization" type="xs : string"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 



8.4.4.3 



Segment Information 



8.4.4.3.1 Overview 

The Segment Information provides the following information: 
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the presence or absence of Initialization Segment information 

the HTTP-URL and possibly a byte range for each accessible Segment in each Representation, 

all valid Segment URLs declared by the containing MPD 

for services with MPD@type= ' dynamic ' , the Segment availability start time and Segment availability end 
time of each Segment,, 

the approximate Media Presentation start time of a Media Segment in the Media Presentation timeline within the 
Period, 

The derivation of Initialization and Media Segment Information is provided in subclause 8.4.4.3.2 and 8.4.4.3.3, 
respectively. Reference resolution as defined in section 8.7.2 and base URL selection as defined in section 8.7.3 using 
BaseURL elements as defined in section 8.7.1 shall be applied to any URLs. 

8.4.4.3.2 Initialization Segment Information 

Each Representation has assigned at most one Initialization Segment. 

The presence of an Initialization Segment is indicated by the presence of SegmentBase . Initialization, 
SegmentLlst . Initialization, the SegmentTemplate. Initialization element or the 
SeginentTemplate@initialization attribute that may contain URL and byte range information or URL 
construction rules for the Initialization Segment. 

If neither Initialization element nor SegmentTemplate@initialization are present for a 
Representation then each Media Segment within the Representation shall be self-initializing. 

For services with MPD@type= ' dynamic ' , the Segment availability start time of the Initialization Segment is the 
sum of the value of the MPD@availabilityStartTime and PeriodStart time and the Segment availability end 
time of the Initialization Segment is the largest Segment availability end time of any Media Segment in this 
Representation. For Segment availability for media Segments refer to clause 8.4.4.3.3. 

The data structures retrieved from the Initialization URL are defined in section 9.2.2. 

8.4.4.3.3 Media Segment Information 

Each Representation has assigned a list of consecutive Media Segments. Each entry in the list of a media Segment has 
assigned the following parameters: 

o a valid Media Segment URL and possibly a byte range. 

o the number of the Media Segment in the Representation. 

o the MPD start time of the Media Segment in the Representation providing an approximate presentation 
start time of the Segment 

o MPD duration of the Media Segment providing an approximate presentation duration of the Segment 

These parameters are specified by the SegmentTemplate or SegmentLlst elements. To obtain at least one entry 
in the list of Media Segments, one of the following shall apply: 

o if SegmentTemplate element is present the Template-based Segment URL construction in section 
8.4.4.4 shall be applied with the number of the Media Segment in the Media Segment list. The first 
number in the list is determined by the value of the SegmentTemplate@startNumber attribute, 
if present, or is 1 in case this attribute is not present. 

o if one or more SegmentLlst elements are present they contain itself a list of SegmentURL 

elements for a consecutive list of Media Segment URLs. The first number in the list is determined by 
the value of the SegmentLlst@startNumber attribute, if present, or is 1 in case this attribute is 
not present. The sequence of multiple SegmentLlst elements within a Representation shall result 
in Media Segment List with consecutive numbers. 
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o none of the above: In this case only a single Media Segment shall be present with the URL provided 
by a BaseURL element and the SegmentBase element may be present. 

The MPD start time is relative to the start of the Representation provided by the MPD. The MPD start time and the 
MFD duration are approximate and do not reflect the exact Media Presentation time. For more details on the relation of 
MPD start times and Media Presentation time refer to section 9.4.1.2. 

For the derivation of the MPD start time and duration of each Media Segment, the Index of the Media Segment and the 
following information are used 

If the ©duration attribute is not present, then the Representation shall contain exactly one Media 
Segment. The MPD start time is and the MPD duration is obtained in the same way as for the last 
Media Segment in the Representation (see below for more details). 

If @durat ion attribute is present, then the MPD start time of the Media Segment is determined as 
(Number— Numberstart -1) times the value of the duration of the attribute ©duration with 
Numbergf^n the value of the ©startNumber attribute. The MPD duration of the Media Segment is 
the value of the attribute ©duration except for the duration of the last Media Segment (see below 
for more details). 

To determine the duration of the only or the last Media Segment of any Representation in a Period, the 
MPD shall include sufficient information to determine the duration of the containing Period. For 
example, the MPD@mediaPresentationDuration, or add Period@duration, or next 
Period@start may be present. 

For services with MPD@type= ' dynamic ' , the Segment availability start time of a Media Segment is the sum of the 
value of the MPDeavailabilityStartTime, the PeriodStart time of the containing Period as defined in section 
8.4.2, the MPD start time and the MPD duration of the Media Segment in the Representation. The Segment availability 
end time of a Media Segment is the sum of the Segment availability start time, the MPD duration of the Media Segment 
and the value of the attribute MPD@timeShif tBuf f erDepth. 

The MPD shall include URL information for all Segments with an availability start time less than both the end of the 
presentation and the sum of the latest time at which this version of the MPD is available on the server and the 

MPD@minimumUpdatePeriod. 

The data structures retrieved from the URL referring to a Media Segment are defined in section 9.2.3. 

8.4.4.4 Template-based Segment URL Construction 

The SegmentTemplate@Tnedia attribute and the SegmentTemplate@index each contain a string that may 
contain one or more of the identifiers as listed in Table 8-27. 

In each URL, the identifiers from Table 8-27 shall be replaced by the substitution parameter defined in Table 17. 
Identifier matching is case-sensitive. If the URL contains unescaped $ symbols which do not enclose a valid identifier 
then the result of URL formation is undefined. In this case it is expected that the DASH Client ignores the entire 
containing Representation element and the processing of the MPD continues as if this Representation 
element was not present. The format of the identifier is also specified in Table 8-27. 

Each identfier may be suffixed, within the enclosing "$" characters, with an additional format tag aligned with the 
printf format tag as defined in IEEE 1003.1-2008 [s] following this prototype: 

%0 [width] d 

The width parameter is an unsigned integer that provides the minimum number of characters to be printed. If the value 
to be printed is shorter than this number, the result shall be padded with zeros. The value is not truncated even if the 
result is larger. 

The Media Presentation shall be authored such that the application of the substitution process results in valid Segment 
URLs. 

Strings outside identifiers shall only contain characters that permit to form a valid HTTP-URL according to 
RFC 1738 [19]. 
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Table 8-27: Identifiers for Segment Templates 



$<ldentifier>$ 


Substitution parameter 


Format 


$$ 


Is an escape sequence, i.e. "$$" is replaced with a 
single "$" 


not applicable 


$RepresentationID$ 


This identifier is substituted by the attribute 
Representationoid of the containing 
Representation. 


The format tag shall not be present. 


$Number$ 


This identifier is substituted by the Number o^ the 
corresponding Segment. 


The format tag may be present. 

If no format tag is present, a default 
format tag with width=1 shall be used. 



8.5 MPD Update 
8.5.1 General 

If the MPD@type is set to 'dynamic', the MPD may be updated during the Media Presentation. Updates typically 
extend the accessible Segment list for each Representation, introduce a new Period or terminate the Media Presentation. 

In this case the MPD shall be made accessible at all locations specified in any present MPD. Location element or, if 
none is present, at the same location as the initial MPD. If the client fetches the MPD using HTTP, the client should use 
conditional GET methods as specified in RFC 2616 [9], clause 9.3 to reduce unnecessary network usage in the 
downlink. 

When the MPD is updated 

• the value of MPD@id, if present, shall be the same in the original and the updated MPD; 

• the values of any Period@id attributes shall be the same in the original and the updated MPD, unless the 

containing Period element has been removed. 

• the values of any AdaptationSet@id attributes shall be the same in the original and the updated MPD 

unless the containing Period element has been removed. 

• any Representation with the same @id and within the same Period as a Representation appearing in the 

previous MPD shall provide functionally equivalent attributes and elements, and shall provide functionally 
identical Segments with the same indices in the corresponding Representation in the new MPD. 

If the attribute MPD@minimuTnUpdatePeriod is not present, no update to the MPD is expected, the attribute 
MPD@mediaPresentationDuration shall be present and the MPD shall remain valid until the Media 
Presentation end time. 

If the attribute MPD@minimuTnUpdatePeriod is present, updates to the MPD are expected and restricted in a sense 
that at the location where the MPD is available at a certain time, the MPD is also valid for the duration of the value of 
the MPD@minimuinUpdatePeriod attribute. Specifically the following shall hold: . 

If the /-th version of the MPD is the last version of MPD till the end of the Media Presentation, let Texp(i) be the Media 
Presentation end time. Otherwise, let Texp(i) be the sum of the value of MPD@minimumUpdatePeriod and the wall- 
clock time at which the /-th version of the MPD is updated (and replaced with the (/H-l)-th version). The /-th MPD shall 
remain valid until Texp(i) in the following sense: 

• all Segments with availability start time less than Texp(i) shall be available at their availability start times at the 

location advertised in the /-th MPD. 

• all Representations have a Segment with an availability start time, Tavail, which is less than Texp(i) and with 

duration not less than (Texp(i) - Tavail). The actual duration of this Segment is not known by the client until 
this Segment or the next update of MPD is fetched and this duration may be less than the normal Segment 
duration if it is the last Segment of the Representation in this Period. 
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NOTE: 

1) the actual duration of this Segment is not known at the chent until this Segment or the updated MPD is fetched 
and this Segment duration may be less than the previous Segment duration if it is the last Segment in the Period. 

2) The clients may not know Texpii), but they can each calculate a lower bound on Texp(i) by adding 
MPD@minimumUpdatePeriod to the wall-clock time at which they request the MPD. 

3) The second condition above ensures that sufficient media is contained in each Representation to present up to 
media presentation time Texp(i) for a client that begins playing each Segment at the earliest possible time (its 
availability start time). 

4) The result of the MPD validity requirement is that all items a client expects to be able to retrieve (both Segments 
and MPD elements) are guaranteed to be available for retrieval during the periods that the client can expect them 
to be accessible. 

5) An MPD may contain no Period element or only an early available Period may be provided. In this case, updates 
to the MPD are expected in order to provide the start time of the first Period, which coincides with the start of 
the actual Media Presentation. 

8.5.2 Media Presentation Description Delta 

If the x3gpp : Del taSupport element is present in the MPD element, the content provider indicates that MPD delta 
files, as defined in this clause, are supported on the server. The URI of the MPD delta is provided in 
x3gpp: Del taSupport ©sourceURL. The x3gpp:DeltaSupport ©availabilityDuration element, if 
present, indicates that the MPD delta file referenced by the URI is available for at least the value of the 
©availabilityDuration attribute (after this time, the server may redirect the client to the full MPD). If 
xSgpp : DeltaSupport ©availabilityDuration is not present, then no information is conveyed about the 
availability of the MPD delta. If a client request for an MPD delta file results in an error, the client should request a full 
MPD. 

The semantics of the attributes within the x3gpp: DeltaSupport element are provided in Table 8-28. The XML- 
syntax of xSgpp: DeltaSupport element is provided in Table 8-29. 

Table 8-28: Semantics of x3gpp: DeltaSupport element 



Element or Attribute Name 


Use 


Description 


xSgpp : DeltaSupport 




If present, this element indicates that MPD delta files are 
supported by the server. 


©sourceURL 


M 


The source string providing the URL of the MPD delta. The 
URL may be relative to any BaseURL on MPD level and 
reference resolution according to clause 8.2.3 shall be 
applied. 


©availabilityDuration 





When provided, indicates the duration that the server 
guarantees the availability of the MPD delta file referenced in 
©sourceURL after the MPD has been updated. After that 
the client may be redirected to the full MPD. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CIVI=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 8-29: XML-Syntax of xSgpp: DeltaSupport element 



<! --DeltaSupport for the MPD --> 

<xs : complexType H.riie= "DeltaSupportType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 

</xs : sequence> 

<xs : attribute name="sourceURL" type="xs : anyURI" use="required"/> 

<xs : attribute name=" aval labilityDurat ion" type="xs : duration" /> 

<xs : anyAt tribute namespace="##other" processContents="lax"/> 
</xs : complexType> 
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An MPD delta is a text file that shall include the delta between the MPD that references it and the latest provided MPD. 
Note that the value of @sourceURL in successive MPDs is necessarily different because it is impossible for the delta 
between two different MPDs and the most recent MPD to be the same. 

The output format consists of one or more structures, each corresponding to a change. The changes are in decreasing 
line number order. The structure format looks like: 

change - command 
to-f ile-line 
to-f ile-line . . . 

There are three types of change commands change - command. Each consists of a line number or comma-separated 
range of lines in the first file and a single character indicating the kind of change to make. All line numbers are the 
original line numbers in the file. The types of change commands and the instructions are provided in Table 8-30. 

Table 8-30: Change commands and the instructions for delta MPD files 



Change 
command 


Instruction 


Example 


la 


Add text from the second file after line I in the first file. 


"8a" means to add the following lines 
after line 8 of file 1 


re 


Replace the lines in range r in the first file with the following 
lines. Like a combined add and delete, but more compact. 


"5 , 7c" means change lines 5-7 of file 1 
to read as the text file 2. 


rd 


Delete the lines in range r from the first file. 


"5 , 7d" means delete lines 5-7 of file 1 . 


NOTE: This is tlie format supported by the GNU diff utilities, see 
http://www.gnu.0rg/software/diffutils/manual/#Detailed-ed 



Regardless of the presence of a x3gpp:DeltaSupport element, the full MPD shall always be available to clients 
for regular MPD updates as defined in clause 8.5.1. MPD Delta related procedures are optional at the client. 



8.6 



Additional IVIedia Presentation Information 



8.6.1 



Introduction 



The MPD, Periods, Adaptation Sets, Representations and Sub-Representations may have assigned descriptors for 
describing the content or other elements in the MPD. This clause specifies this descriptive information. 



8.6.2 Program Information 



Descriptive information on the program may be provided for each period within the Programlnformation 
element. 

When multiple Programlnformation elements are present, the @lang attribute shall be present and each element 
shall describe the Media Presentation sufficiently in the language defined by the value of the @lang attribute. 

For each language, the program information may specify title, source of the program, copyright information and a URL 
to more information. 

The semantics of the attributes within the Programlnformation element are provided in Table 8-31. The XML- 
syntax of Programlnformation element is provided in Table 8-32. 

Table 8-31 : Semantics of Programlnformation element 



Element or Attribute Name 


Use 


Description 


Programlnformation 




specifies descriptive information about the program 


(Slang 





Declares the language code(s) for this Program Information. 
The syntax and semantics according to IETF RFC 5646 [13] 
shall be applied. 
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If not present the value is unknown. 


©moreinf ormationURL 





If specified, this attribute contains an absolute URL which 
provides more information about the Media Presentation in 
this Period. 

If not present the value is unknown. 


Title 


0...1 


specifies a title for the Media Presentation 


Source 


0...1 


specifies information about the original source (for example 
content provider) of the Media Presentation. 


Copyright 


0...1 


specifies a copyright statement for the Media Presentation. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 8-32: XML-Syntax of Programinf ormation element 



<!-- Program information for a presentation --> 
<xs : complexType name="ProgramInformationType" > 
<xs : sequence> 

<xs:element name^"Title" i^ji-.v,-"xs : string" "iinOccurs = "0"/> 
<xs:element name="Source" type="xs : string" linOccurs; "0"/> 
<xs:element name = " Copyright" type = "xs : string" minOccuj. j-"0"/> 

<xs : any namespace^ "##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded"/; 
</xs : sequence> 

<xs :attribute name="lang" type="xs : language"/> 
<xs : attribute name="moreInformationURL" type^"xs :anyURI"/> 
<xs :anyAt tribute namespace ="##other" processContents^ "lax"/> 
</xs : complexType> 



8.6.3 Descriptors 
8.6.3.1 General 

The MFD may contain descriptors that are all in the same format as defined in this clause. The elements of type 
DescriptorType provide a flexible mechanism for DASH content authors to annotate and extend the MPD, 
Period, AdaptationSet and Representation elements. 

The description elements are all structured in the same way, namely they contain a @schemeIdUri attribute to 
identify the scheme and an optional attribute ©value. The ©scheme IdUri provides a URI to identify the scheme. 
The semantics of this element is specific to the scheme employed. The scheme may be a URN or a URL. 

The MPD does not provide any specific information on how to use these elements. It is up to the application that 
employs DASH formats to instantiate the description elements with appropriate scheme information. Some specific 
schemes are defined in Annex C. 

DASH applications that use one of these elements must first define a Scheme Identifier in the form of a URI and must 
then define the value space for the element when that Scheme Identifier is used. The Scheme Identifier appears in the 

©scheme I dUri attribute. 

In the case that a simple set of enumerated values are required, a text string may be defined for each value and this 
string must be included in the ©value attribute. If structured data is required then any extension element or attribute 
may be defined, but in a separate namespace. 

Two elements of type DescriptorType are equivalent, if the element name, the ©schemeldUri and the ©value 
are equivalent. If the ©schemeldUri is a URN, then equivalence refers to lexical equivalence as defined in clause 5 
of RFC 2141. If the ©schemeldUri is a URI, then equivalence refers to equality on a character-for-character basis as 
defined in clause 6.2.1 of RFC 3986 [17]. For the ©value XML-string matching shall be used for determining 
equivalence. If the ©value attribute is not present, equivalence is determined by the equivalence for ©schemeldUri 
only. 

The semantics of the attributes within a Generic Descriptor element are provided in Table 8-33. The XML-syntax of a 
Generic Descriptor element is provided in Table 8-34. The specific descriptors follow these syntax and semantics. 
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Table 8-33: Semantics of generic Descriptor element 



Element or Attribute Name 


Use 


Description 


Element of type 

DescriptorType 




This element provides information about the use of 
description. 


©schemeldUri 


M 


Provides a URI to identify the scheme. The definition of this 
element is specific to the scheme employed for content 
description. The URI may be a URN or a URL. The 
oschemeiduri may be a URN or URL. When a URL is 
used, it should also contain a month-date in the form 
mmyyyy; the assignment of the URL must have been 
authorized by the owner of the domain name in that URL on 
or very close to that date, to avoid problems when domain 
names change ownership 


©value 





This attribute provides the value for the descriptor element. 
The value space and semantics must be defined by the 
owners of the scheme identified in the oschemeiduri 
attribute. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally IVIandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 8-34: XML-Syntax of generic Descriptor element 



<!-- Generic named descriptive information --> 

<xs : complexType name =" DescriptorType" > 
<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" 
</xs : sequence> 

<xs : attribute name="schemeIdUri" type="xs : anyURI" use="required"/> 
<xs : attribute name="value" type="xs : string" use="optional"/> 
<xs : anyAt tribute name space ="##other" processContents="lax"/> 

</xs : complexType> 



maxOccur s = "unbounded " / ; 



8.6.3.2 



Content Protection 



For the element ContentProtection the ©schemeldUri attribute is used to identify the content protection 
schemes employed. This attribute should provide sufficient information, possibly in conjunction with the ©value 
and/or extension attributes and elements, such as the DRM system(s), encryption algorithm(s), and key distribution 
scheme(s) employed, to enable a client to determine whether it can possibly play the protected content. The 
ContentProtection element can be extended in a separate namespace to provide information specific to the 
content protection scheme (e.g. particular key management systems or encryption methods). Scheme-specific 
information can also be provided in the Initialization Segment(s) using the appropriate file format primitives instead of, 
or in addition to, the ContentProtection element. The client may have to receive and analyze the protected 
content (typically only the Initialization Segment, if present), before it can determine whether it has already acquired a 
license and/or key for accessing the protected content, or to determine from where it can acquire a missing license 
and/or key, in case this information is not available from the ContentProtection element. 

When the ContentProtection element is not present the content shall neither be encrypted nor content protected. 

When multiple ContentProtection elements are present, each element shall describe a content protection scheme 
that is sufficient to access and present the Representation. 



8.6.3.3 



Role 



For the element Role the ©schemeldUri attribute is used to identify the role scheme employed to identify the role 
of the media component. Roles define and describe characteristics and/or structural functions of media components. 

One Adaptation Set or one media content component may have assigned multiple roles even within the same scheme. 

This specification defines a role scheme in Annex C.2. 
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8.6.3.4 Rating 

For the element Rating the ©scheme IdUri attribute is used to identify the rating scheme employed. 

Ratings specifies that content is suitable for presentation to audiences for which that rating is known to be appropriate, 
or for unrestricted audiences. 

NOTE: An audience with a rating restriction is intended to not be presented content that has associated ratings, 
unless at least one scheme is recognized as indicating that the content is appropriate to that audience. 

8.6.3.5 Viewpoint 

For the element Viewpoint the ©scheme IdUri attribute is used to identify the viewpoint scheme employed. 

Adaptation Sets containing non-equivalent Viewpoint contain different media content components. The 
Viewpoint elements may equally be applied to media content types that are not video. . 

Adaptation Sets with equivalent Viewpoint element values are intended to be presented together. This handling 
should be applied equally for recognised and unrecognised ©schemeldUri values. 

8.6.3.6 Accessibility 

For the element Accessibility the ©schemeldUri attribute is used to identify the accessibility scheme 
employed. Accessibility is a general term used to describe the degree to which the DASH Media Presentation is 
available to as many people as possible. 

NOTE Accessibility elements fulfil a very similar purpose with respect to media content components 
as for Role elements, but are specifically intended for accessibility. 

One Adaptation Set or one media content component may have assigned multiple accessibility purposes even within the 
same scheme. 

This specification does not define a specific accessibility scheme, but the simple role scheme in may be used to express 
a minimum amount of accessibility information. 

8.6.3.7 Audio channel configuration 

For the element AudioChannelConf iguration the @schemeIdUri attribute is used to identify the audio 
channel configuration scheme employed. 

Multiple AudioChannelConf iguration elements may be present indicating that the Representation supports 
multiple audio channel configurations. For example, it may describe a Representation that includes MPEG Surround 
audio supporting stereo and multichannel. 

NOTE if the scheme or the value for this descriptor is not recognized the DASH client is expected to ignore the 
descriptor. 



8.6.3.8 Frame packing 

For the element FramePacking the ©scheme IdUri attribute is used to identify the frame packing configuration 
scheme employed. 

Multiple FramePacking elements may be present. If so, each element shall contain sufficient information to select or 
reject the described Representations or Sub-Representations. 

This specification defines a frame packing scheme in Annex C.3. 

NOTE: If the scheme or the value for all FramePacking elements is not recognized, the 3GP-DASH client is 
expected to ignore the described Representations or Sub-Representations. 
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8.7 Base URL Processing 
8.7.1 General 

The BaseURL element may be used to specify one or more common locations for Segments and other resources. 
Reference resolution as defined in 8.7.2 shall be applied to each URL in the MPD. Handling of multiple alternative base 
URLs is addressed in 8.7.3. 

The semantics of the attributes and elements for the Base URL are provided in Table 8-35. The XML syntax of the Base 
URL is provided in Table 8-36. 

Table 8-35 — Semantics of BaseURL element 



Element or Attribute Name 


Use 


Description 


BaseURL 




A URL that can be used as Base URL. The content of this 
element is a URI string as described in 8.7.2. 


©serviceLocation 





This attribute specifies a relationship between BaseURLs 
such that BaseURL elements with the same 
©serviceLocation value are likely to have their URLs 
resolve to services at a common network location, for 
example a common Content Delivery Network. 

If not present, no relationship to any other Base URL is 
known. 


Legend: 

For attributes: IVI=l\/landatory, 0=Optional, OD=Optional witli Default Value, CIVI=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 8-36 — XML-Syntax of BaseURL element 



< ! -- Base URL --> 

<xs : complexType .iame = "BaseURLType" > 
<xs : simpleContent> 

<xs : extension base="xs : anyURI" > 

<xs : attribute name="serviceLocation" type="xs : Btring"/> 
<xs : anyAt tribute namespace="##other" processContents="lax"/> 
</xs : extension> 
</xs : simpleContent> 
</xs : complexType> 



8.7.2 Reference resolution 

URLs at each level of the MPD are resolved according to RFC3986 with respect to the BaseURL element specified at 
that level of the document or the level above in the case of resolving base URLs themselves (the document 'base URI' as 
defined in RFC 3986 [17], Section 5.1 is considered to be the level above the MPD level). If only relative URLs are 
specified and the document base URI cannot be established according to RFC3986 [17] then the MPD should not be 
interpreted. URL resolution applies to all URLs found in MPD documents. 

In addition to the document level (the level above the MPD level), base URL information may be present on the 
following levels: 

• On MPD level in MPD . BaseURL element. For details refer to section 8.4. 1 . 

• On Period level in Period . BaseURL element. For details refer to section 8.4.2. 

• On Adaptation Set level in AdaptatlonSet . BaseURL element. For details refer to section 8.4.3.3. 
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• On Representation level in Representation . BaseURL. For details refer to section 8.4.3.4. 

8.7.3 Alternative base URLs 

If alternative base URLs are provided through the BaseURL element at any level, identical Segments shall be 
accessible at multiple locations. In the absence of other criteria, the DASH Client may use the first BaseURL element 
as 'base URI". The DASH Client may use base URLs provided in the BaseURL element as 'base URI' and may 
implement any suitable algorithm to determine which URLs it uses for requests. 



DASH - Usage of 3GPP File Format 



9.1 Introduction 

3GPP Dynamic Adaptive Streaming over HTTP uses many elements of fragmented 3GP files to define the Segment 
formats. This provides Segments according to the requirements defined in clause 8.4.4.1 and enables reuse of existing 
content, easy encoding and recording, etc. This clause introduces how to use the 3GPP file format as specified in 
TS 26.244 [4] for DASH Segment formats. 

9.2 Segment Types and Formats 

9.2.1 Introduction 

3GP-DASH defines a Segment format that is used in the delivery of media data over HTTP. A Segment shall contain 
one or more boxes in accordance with the boxed structure of the ISO-base media file format [11]. 

For 3GP-DASH the following apphes: 

In all cases for which a Representation contains more than one Media Segment, the following applies: 

- The Initialization Segment as defined in clause 9.2.2 shall be present. The Initialization Segment shall be 

available for the 3GP-DASH client before any Media Segment is processed within the Representation. 

- Media Segments shall not be self-initializing. The Media Segment format is defined in clause 9.2.3. 

In case a Representation contains only a single Media Segment, then either one of the following two options is 
used: 

1) An Initialization Segment as defined in clause 9.2.2 and one Media Segment as defined in clause 9.2.3. 

2) One Self-Initializing Media Segment as defined in clause 9.2.4. 

9.2.2 Initialization Segment 

The Initialization Segment is conformant with the 3GPP file format, adaptive streaming profile and shall carry '3gh9' 
as compatibility brand. 

The Initialization Segment consists of the 'ftyp' box, the 'moov' box, and optionally the 'pdin' box. The 'moov' box 
shall not contain any samples (i.e. the entry_count in the 'stts', 'stsc', and 'stco' boxes shall be set to 0) and is then 
very small in size. This reduces the start-up time significantly as the Initialization Segment needs to be downloaded 
before any Media Segment can be processed. 

The 'mvex' box shall be contained in the 'moov' box to indicate that the client has to expect movie fragments. The 
'mvex' box also sets default values for the tracks and samples of the following movie fragments. 

The Initialization Segment provides the client with the metadata that describes the media content. The client uses the 
information in the 'moov' box to identify the available media components and their characteristics. 

The Initialization Segment shall not contain any 'moof or 'mdat' boxes. 
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9.2.3 Media Segment 

9.2.3.1 General 

A Media Segment contains and encapsulates media streams that are either described within this Media Segment or 
described by the InitiaHzation Segment of this Representation or both. 

In addition, a Media Segment 

1 . shall contain a number of complete access units. 

2. should contain at least one Stream access point (SAP) for each contained media stream. 

3. should provide information on how to access the Media Presentation within this Segment, e.g. exact 
presentation time and an index. There is no requirement that a Media Segment starts with a SAP, but it is 
possible to signal in the MPD that all media streams in a Segments within a Representation start with a 
SAP. 

4. if it is the first Media Segment in the Representation, it shall contain only media streams that start with a 
SAP of type 1 or 2. 

5. shall contain sufficient information to time-accurately present each contained media component in the 
Representation without accessing any previous Media Segment in this Representation provided that the 
Media Segment contains a SAP for each media stream. The time-accuracy enables a client to seamlessly 
switch Representations and jointly present multiple Representations. 

6. may be divided into Subsegments by a Segment Index as defined in 9.2.3.2. 

7. shall specify all Media Presentation times relative to the start of the Period and compensated with the value 
of the ©presentationTimeOf f set. The presentation time in Media Segments shall be accurate to 
ensure accurate alignment of all Representations in one Period. For more details refer to 9.4.1.1. 

9.2.3.2 Subsegments and Segment Index 

Media Segments may contain multiple Subsegments. Each Subsegment shall contain a number of complete access units. 
There may also be media-format-specific restrictions on Subsegment boundaries. If a Segment is divided into multiple 
Subsegments this division is described by a compact Segment index, which provides the presentation time range in the 
Representation and corresponding byte range in the Segment occupied by each Subsegment for one or more media 
streams. Clients may download this index in advance and then issue requests for individual Subsegments. 

In addition, the Segment Index provides timing and stream access information. This includes the earliest presentation 
time of access units in each Subsegment of an indexed media stream and the presentation time of the first SAP, if 
present. 

If a Segment Index is present for at least one media stream, then for any media stream for which no Segment Index is 
present, referred to as non-indexed stream, the following applies: 

• every access unit of the non-indexed streams shall be a SAP of type 1 . 

• for each Subsegment, every non-indexed stream must contain exactly one access unit within the Subsegment 

with presentation time less than or equal to the earliest presentation time of the Subsegment 

When multiple media streams are indexed in a single index file, the corresponding Segment Index for different 
media streams should index the same number of Subsegments. 

If no Segment Index is provided for a Media Segment, then the Media Segment constitutes one Subsegment. 

A Subsegment may itself be further subdivided using further Segment Index boxes. If a Subsegment only contains 
media data but no Segment Index, it is referred to as Media Subsegment. 

1) The Segment Index may contain additional Subsegment indexing information for accessing different levels of 
Subsegments in a Media Subsegment. For more details refer to clause 9.2.3.3. 
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A generic mechanism for indexing of Media Segments is provided by the Segment Index ("sidx") box in TS26.244 
[4]. In this case, 

• the earliest presentation time of a Subsegment is documented in the earliest_presentation_time 

field. 

• the byte range is document by the f irst_of f set field and the ref erence_size field. If two 

Segment Index boxes document the same byte range, then the value of their f irst_of f set field and 
their ref erence_size field shall be identical. 

9.2.3.3 Subsegment Index 

Media Subsegments may be indexed further to enable accessing different levels of Subsegments in a Media 
Subsegment. This Subsegment Index may also be provided in separate Index Segments together with the Segment 
Index. 

A generic syntax and semantic for Subsegment indexing is provided by the Subsegment Index ("ssix") in Annex G.3. 

9.2.3.4 3GP-DASH Media Segment Format 

A Media Segment conforming to the Media Segment Format for 3GP DASH shall carry "3gmA" as a compatible brand 
and is defined as follows: 

Each Media Segment may contain an "styp" box. 

If the Media Segment is the last media Segment in the Representation, the ' styp ' box may carry "Imsg" as a 
compatible brand. 

Each Media Segment shall contain one or more whole self-contained movie fragments. A whole, self-contained 
movie fragment is a movie fragment ("moof ") box and a media data ("mdat") box that contains all the media 
samples that do not use external data references referenced by the track runs in the movie fragment box. 

Each "moof" box shall contain at least one track fragment. 

The "moof" boxes shall use movie-fragment relative addressing and the flag "default -base -is -moof" 
shall also be set. Absolute byte-offsets shall not be used. In a movie fragment, the durations by which each track 
extends should be as close to equal as practical. In particular, as movie fragments are accumulated, the track 
durations should remain close to each other and there should be no 'drift'. 

Each "traf " box shall contain a "tf dt" box. 

The track fragment adjustment box "tf ad" as defined in 3GPP TS26.244 [4] may also be present to maintain 
compatibility with earlier releases of this specification; care should be taken that the alignment established by the 
"tf dt" and the time-shifting implied by the "tf ad" not be both applied, which would result in a double 
correction. 

Each Media Segment may contain one or more "sidx" boxes. If present, the first "sidx" box shall be placed 
before any "moof" box and the subsegment documented by the first Segment Index ("sidx") box shall be the 
entire Segment, i.e. the entire Segment shall be document by the first Segment Index ("sidx") box. 

A media Segment may contain a Subsegment Index box ("ssix"). If present it shall follow immediately after 
the "sidx" box that documents the same subsegment. This immediately preceding "sidx" shall only index 
subsegments. 

Further rules on media Segments in combination with certain MFD attributes are provided in clause 9.4. 



9.2.4 Self-Initializing Media Segment 



A Self-Initializing Media Segment conforms to the concatenation of an Initialization Segment as defined in 9.2.2 and a 
Media Segment as defined in 9.2.3. 
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9.2.5 Media Stream and Segment Properties 

9.2.5.1 Media stream access points 

To be able to access a Representation, each of the media streams that are contained in the Representation requires 
Media Stream Access Points (S APs). Annex G.6 defines different types of S APs that provide a relationship between the 
position where a stream can be accessed, a SAP, relative to the start of a Segment or Subsegment, its presentation time 
and the presentation times and position of other access unit in the stream. 

A SAP is a position in a Representation that enables playback of a media stream to be started using only the information 
contained in Representation data starting from that position onwards (preceded by initializing data in the Initialization 
Segment, if any). 

For each SAP the properties, Isap, Tsap, Isau^ Tqec^ Tept, and Tpxp are identified and defined in Annex G.6.2. 

In particular, Tsap is defined to be earliest presentation time of any access unit of the media stream such that all access 
units of the media stream with presentation time greater than or equal to Tsap can be correctly decoded using data in the 
Representation starting at byte position Isap and no data before Isap- 

9.2.5.2 Non-overlapping Segments and Subsegments 

Segments and Subsegments represent units for which the client has an exact map on how to access and download the 
unit using HTTP GET or HTTP partial GET methods. 

Segments (respectively Subsegments) are typically generated by segmenting encoded media streams into appropriate 
units. If the generation of Segments (respectively Subsegments) adheres to certain rules, then the sequential decoding 
and presentation of Media Segments (respectively Subsegments) results in a correct presentation of all contained media 
streams. To define such rules the notion of 'non-overlapping' Segments (respectively Subsegments) is defined as 
follows. 

Let 

• T^S,i) be the earliest presentation time of any access unit in stream / of a Segment or Subsegment S, 

• TiiS,i) be the latest presentation time of any access unit in stream / of a Segment or Subsegment S. 

Then two Segments (respectively Subsegments), A and B, which may or may not be of different Representations, are 
non-overlapping if Ti(A,i) < TE(B,i) for all media streams / in A and B or if TJBJ) < TE(A,i) for all streams / in A and B 
where / refers to the same media component. 

The property of 'non-overlapping' Segments (respectively Subsegments) is used to define the terms Segment alignment 
and Subsegment alignment. 

9.2.5.3 Bitstream concatenation 

A sequence of Segments (respectively Subsegments) is a "conforming Segment (respectively Subsegment) sequence" if 
the concatenation of all Segments (respectively Subsegments) in the sequence of Segments (respectively Subsegments) 
results in a bitstream that conforms to the media formats in use (including container and codecs). 

NOTE This implies that a player conforming to the media format can play the resulting bitstream. 
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9.3 Usage on Server and Client 



3GP-DASH uses 3GP files according to the 3GP Adaptive-Streaming profile as specified in TS 26.244 [4]. Content 
may be prepared as 3GP files according to the 3GP Adaptive-Streaming profile. Initialization Segments and Media 
Segments may be generated by segmenting such 3GP files. Segment Index "sidx" boxes may be pre-contained in 3GP 
files or may be generated during the segmentation process. Clients may store a concatenation of a received Initialization 
Segment and a sequence of Media Segments from the same Representation to create a compliant 3GP file according to 
the Adaptive Streaming profile without accessing any media samples. 

NOTE: As specified in TS 26.244, the MPD may be Hnked or embedded in the "meta" box of the "moov" box. 
This enables clients to access the MPD from a 3GP file that was made available from other means than 
3GP-DASH (e.g. progressive download). 

9.4 Segment Properties with IVIPD constraints 

9.4.1 General 
9.4.1.1 Introduction 

The content, especially the Segments across Representations at the same media time may have been prepared in a joint 
or at least coordinated manner. To expose these properties to the client, certain flags in the MPD can be set to true to 
indicate such coordinated content preparation. Clients consuming 3GP-DASH formatted media presentations may 
benefit from properly authored content when switching between or presenting Representations. 



9.4.1 .2 Media Presentation Timeline 

One of the key features in DASH is that encoded versions of different media components share a common timeline. The 
presentation time of access unit within the media content is mapped to the global common presentation timeline for 
synchronization of different media components and to enable seamless switching of different coded versions of the 
same media components. 

The presentation times within each Period are relative to the PeriodStart time of the Period minus the value of the 
@presentationTimeOf f set. To, of the containing Representation. This means for an access unit with a 
presentation time Tp signalled in the media stream, the Media Presentation time relative to the PeriodStart is rM=7p- 
7^0- 

Media Segments should not contain any presentation time Tp that is smaller than the value of the 

©presentationTimeOf f set, Tq. However, if this is the case, then presentation of the Media Segment is expected 
to only take place for presentation times greater than or equal to Tq. 

The MPD start times as defined in 8.4.4.3.3 shall provide an approximation of the Media Presentation time T^ within 
the Period. Specifically, the MPD start time shall be drift-free relative to the presentation time Tp signalled in the media 
stream, i.e. the accuracy of the offset of the MPD start time relative to the presentation time does not depend on the 
position of the Segment in the Representation. 

NOTE At the start of a new Period, the playout procedure of the media content components may need to be 

adjusted at the end of the preceding Period to match the PeriodStart time of the new Period as there may 
be small overlaps or gaps with a Representation at the end of the preceding Period. Overlaps (respectively 
gaps) may result from Media Segments with actual presentation duration of the media stream longer 
(respectively shorter) than indicated by the Period duration. Also in the beginning of a Period if the 
earliest presentation time Tp of any access unit of a Representation is larger than then the playout 
procedures need to be adjusted accordingly. 

For the case when MPD@type is "dynamic" and the attribute MPD@suggestedPresentationDelay is present, 
then the sum of value of the the MPD@availibilityStartTime, the PeriodStart value, the presentation time 
within the Period of an access unit, Tm, and the value of the the attribute MPD@suggestedPresentationDelay 
provides a mapping of the presentation time of each access unit to the wall-clock time, for example to express 
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synchronization with a content internal time or for other reasons to enable synchronization of presentation to the wall- 
clock. 

For the Segment formats as defined in section 9.2.3.4, the presentation time Tp internal in the media that maps the media 
to the Media Presentation timeline shall be relative to the movie timeline, i.e. they are composition times after the 
application of any edit list for the track. 

It is recommended that the @timescale attribute in the MPD matches the timescale field in the Media Header 
Box of a present track. If the Segment Index ('sidx') box is present, then it is further recommended that the track for 
which the Segment Index ('sidx') box that appears first in the Media Segment is the track defining the value of the 
©timescale attribute. 

9.4.1.3 Segment Index 

If a Segment Index is present in a Media Segment of one Representation within an Adaptation Set, then the following 
shall hold: 

• the order of Segment Index boxes for multiple media streams induces an ordering on the media content 

components equal to the order in which a Segment Index box for a media stream for each component first 
appears. This ordering shall be the same for all Segments of all Representations of an Adaptation Set. As a 
consequence, if there is a Segment Index for a media content component in one Segment there shall be a 
Segment Index for that media component in all Segments in this Adaptation Set. 

• non-indexed media streams in all Representations of an Adaptation Set shall have the same access unit 

duration. 

9.4.2 Segment Alignment 

No additional requirements beyond those stated in section 8.4.3.3 are defined. 



9.4.3 Bitstream Switching 



If the ©bitstreamSwitching is set to "true" for a set of Representations within an Adaptation Set, the conditions 
stated in section 8.4.3.3 shall be satisfied. 

As a consequence of ©bitstreamSwitching being set to "true", the following conditions are satisfied: 

• The track IDs for the same media content component are identical for each Representation in each 

Adaptation Set. 

• The conditions required for setting the ©segmentAlignment attribute to a value other than 'false' for 

the Adaptation Set are fulfilled. 

• The conditions required for setting (i) the ©startWithSAP attribute to 2 for the Adaptation Set, or (ii) the 

conditions required for all Representations within the Adaptation Set to share the same value of 
©mediaStreamStructureld and setting the ©startWithSAP attribute to 3 for the Adaptation Set, 
are fulfilled. 



9.4.4 Sub-Representation 



If a SubRepresentatlon element is present in a Representation in the MPD and the 
SubRepresentation©level is present, then the media Segments in this Representation shall include a Segment 
Index ("sidx") box and the Initialization Segment shall contain the Level Assignment ("leva") box. 

The attribute ©level specifies the level to which the described Sub-Representation is associated in the Subsegment 
Index. Level n corresponds to the n-th level in the Subsegment Index. The information in Representation, Sub- 
Representation and in the Level Assignment ("leva") box contains information on the assignment of media data to 
levels. 

Media data should be ordered such that each higher value for ©level provides an enhancement compared to any lower 
value of ©level. 
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For temporal level assignment, the sample grouping ' tele ' as defined in clause G.4, shall be used. 

10 QoE for Progressive Download and DASH 

10.1 General 

A progressive download or 3GP-DASH client supporting Quality of Experience (QoE) shall report QoE metrics 
according to the QoE configuration. QoE reporting is optional, but if a 3GP-DASH client reports DASH metrics, it shall 
report all requested metrics. 

The quality metrics are defined in subclause 10.2. 

The quality metrics applicable for progressive download are specified in section 10.3. In this case the activation and 
configuration of QoE reporting framework is achieved by a corresponding OMA DM QoE Management Object as 
specified in Annex F. 

The quality metrics for DASH are specified in section 10.4. In this case, QoE reporting may be triggered using the MPD 
( i.e. when the Metrics element is present in the MPD) or using OMA DM QoE Management Object as specified in 
Annex F. When QoE reporting is triggered via the MPD or OMA DM QoE Management Object, the 3GP-DASH client 
is expected to collect quality metrics according to the QoE configuration. When using the MPD, the Quality Reporting 
scheme as defined in section 10.5 may be used. 

The quality metric reporting protocol is defined in subclause 10.6. This protocol shall be used when QoE reporting is 
triggered via the MPD or OMA DM QoE Management Object. 

1 0.2 QoE Metric Definitions 

10.2.1 Introduction 

This section provides the general QoE metric definitions and measurement framework. 

The semantics are defined using an abstract syntax. Section 10.6 provides a mapping to an XML schema. Items in this 
abstract syntax have one of the following primitive types (Integer, Real, Boolean, Enum, String) or one of the 
following compound types: 

Ob j ects: an unordered sequence of (key, value) pairs, where the key always has string type and is unique 
within the sequence. 

List: a ordered list of items. 

Set: an unordered set of items. 

Additionally, there are two kinds of timestamp defined, i.e. real time (wall-clock time) and media time. 

1 0.2.2 HTTP Request/Response Transactions 

Table 25 contains the metric defining the List of HTTP Request/Response Transactions. 
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Table 25: List of HTTP Request/Response Transactions 



— Key 


Type 


Description 


HttpList 


List 


List of HTTP request/response transactions 




Entry 


Object 


An entry for a single HTTP request/response 






tepid 


Integer 


Identifier of the TCP connection on which the HTTP request was sent. 






type 


Enum 


This is an optional parameter and should not be included in HTTP 

request/response transactions for progressive download. 

The type of the request: 

-MPD 

-MPD delta file 

- XLink expansion 

- Initialization Segment 

- Index Segment 

- Media Segment 






url 


String 


The original URL (before any redirects or failures) 






actualurl 


String 


The actual URL requested, if different from above 






range 


String 


The contents of the byte-range-spec part of the HTTP Range 
header. 






trequest 


Real Time 


The real time at which the request was sent. 






tresponse 


Real Time 


The real time at which the first byte of the response was received. 






responsecode 


Integer 


The HTTP response code. 






interval 


Integer 


The duration of the throughput trace intervals (ms), for successful 
requests only. 






Trace 


List 


Throughput trace, for successful requests only. 








Entry 


Object 


A single throughput measurement entry. 










s 


Real Time 


Measurement period start. 










d 


Integer 


Measurement period duration (ms). 










b 


List 


List of integers counting the bytes received in each trace interval within 
the measurement period. 



NOTE: 

1) Information additional to that specified in the type may be returned, for example if a client makes a 
request for a initialization information from a self-initializing Media Segment then index information may 
also be received. 

2) All entries for a given object will have the same url and range and so can easily be correlated. If there 
were redirects or failures there will be one entry for each redirect/failure. The redirect-to URL or 
alternative URL (where multiple have been provided in the MPD) will appear as the actualurl of the 
next entry with the same url value. 

3) The periods reported in Entry should be those periods where the client was actively reading from the 
TCP connections (i.e. they should not include periods where the TCP connection is idle due to zero 
receive window). 

The end of the last measurement period reported in the Trace shall be the time at which the last byte of the response 
was received. 

The interval and Trace shall be absent for redirect and failure records. 

The key HttpList (n, type) where n is a positive integer is defined for an HttpList with an interval of n ms 
and type is one of MPD, MPDDeltaFile, XLinkExpansion, InitializationSegment, MediaSegment, or IndexSegment. If 
type is not present, all HTTP transactions are requested to be collected. If type is present, it specifies that the HTTP 
transactions concerning a resource equal to type are requested to be collected. Multiple keys HttpList (n, type) 
with different values of n and type may be present for a single ©metrics attribute value. 

An HTTP transaction that is not finished within a QoE metric collection period shall not be included in the reported 
metrics. 

1 0.2.3 Representation Switch Events 

Table 26 defines the metric to report a list of representation switch events. 
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Table 26: List of Representation Switch Events 



Key 


Type 


Description 


RepSwitchList 


List 


List of representation switch events (a switch event is 
the time at which the first HTTP request for a new 
representation, that is later presented, is sent) 




Entry 


Object 


A representation switch event. 






t 


Real Time 


Time of the switch event. 






mt 


Media Time 


The media time of the earliest media sample (out of 
all media components) played out from the 'to' 
representation. 






to 


String 


Value of Representation@id identifying the switch-to 
representation. 






Lto 


Integer 


If present, value of SubRepresentation@level 
within Representation identifying the switch-to level 
of the Representation 



10.2.4 Average Throughput 

This metric in Table 27 indicates the average throughput that is observed by the client during the measurement interval. 

Table 27: Average Throughput 



Key 


Type 


Description 


AvgThroughput 


Object 


Average throughput that is observed by the client 
during the measurement interval 




numbytes 


Integer 


The total number of the content bytes, i.e. the total 
number of bytes in the body of the HTTP responses, 
received during the measurement interval. 




activitytime 


Integer 


The activity time during the measurement interval In 
milliseconds. The activity time during the 
measurement interval is the time during which at 
least one GET request is still not completed (i.e. 
excluding inactivity time during the measurement 
interval). 




t 


Real Time 


The real time of the start of the measurement interval 




duration 


Integer 


The time in milliseconds of the measurement interval 




accessbearer 


String 


Access bearer for the TCP connection for which the 
average throughput is reported 




inactivity type 


Enum 


Type of the inactivity, if known and consistent 

throughout the reporting period: 

User request (e.g. pause) 

Client measure to control the buffer 

Error case 



If the client requests the media Segments from the server separately over multiple non-competing parallel TCP 
connections established over separate access network bearers named as accessbearer, then the average throughput 
values should be reported as a list of events with average throughput for each access network and associated access 
network bearer information reported separately, following the same guidelines as described above. 



10.2.5 Initial Playout Delay 



This metric in Table 28 signals the initial playout delay at the start of the streaming of the presentation. 
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Table 28: Initial Playout Delay 



Key 


Type 


Description 


InitialPlayoutDelay 


Integer 


The initial playout delay is measured as the time in 
milliseconds from the fetch of the first media Segment 
(or sub-segment) and the time at which media is 
retrieved from the client buffer. 



10.2.6 Buffer Level 

Table 29 defines the metric to report a list of buffer level status events. 

Table 29: List of Buffer Level 



Key 


Type 


Description 


Buf f erLevel 


List 


List of buffer occupancy level measurements during 
playout at normal speed. 




Entry 


Object 


One buffer level measurement. 






t 


Real Time 


Time of the measurement of the buffer level. 






level 


Integer 


Level of the buffer in milliseconds. Indicates the playout 
duration for which media data of all active media 
components is available starting from the current playout 
time. 



The key is Buf f erLevel (n) , where « is a positive integer is defined to refer to the metric in which the buffer level 
is recorded every n ms. 

10.2.7 Play List 

Decoded samples are generally rendered in presentation time sequence, each at or close to its specified presentation 
time. A compact representation of the information flow can thus be constructed from a list of time periods during which 
samples of a single representation were continuously rendered, such that each was presented at its specified presentation 
time to some specific level of accuracy (e.g. +/-10 ms). 

Such a sequence of periods of continuous delivery is started by a user action that requests playout to begin at a specified 
media time (this could be a 'play', 'seek' or 'resume' action) and continues until playout stops either due to a user action, 
the end of the content, or a permanent failure. 

Table 30 defines the play list event metric. 
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Table 30: Play List 



Key 


Type 


Description 


PlayList 


List 


A list of playback periods. A playback period is the time 
interval between a user action and whichever occurs 
soonest of the next user action, the end of playback or a 
failure that stops playback. 




Entry 


Object 


A record of a single playback period. 






start 


Real Time 


Timestamp of the user action that starts the playback 
period. 






mstart 


Media Time 


The presentation time at which playout was requested by 
the user action. 






starttype 


Enum 


Type of user action which triggered playout 

- New playout request (e.g. initial playout or seeking) 

- Resume from pause 

- Other user request (e.g. user-requested quality change) 

- Start of a metrics collection period (hence earlier entries in 
the play list not collected) 






Trace 


List 


List of periods of continuous rendering of decoded samples. 








Traceentry 


Objects 


Single entry in the list. 










represent at ionid 


String 


The value of Representation@id from which the samples 
were taken. 

This is an optional parameter and should not be reported in 
case of progressive download. 










subreplevel 


Integer 


If not present, this metric concerns the Representation as a 
whole. If present, subreplevel indicates the greatest value of 
any SubRepresentation@level being rendered. 
This is an optional parameter and should not be reported in 
case of progressive download. 










start 


Real Time 


The time at which the first sample was rendered. 










sstart 


Media Time 


The presentation time of the first sample rendered. 










duration 


Integer 


The duration of the continuously presented samples (which 
is the same in real time and media time). 'Continuously 
presented' means that the media clock continued to 
advance at the playout speed throughout the interval. 










playbackspeed 


Real 


The playback speed relative to normal playback speed 
(i.e. normal forward playbackspeed is 1.0). 










stopreason 


Enum 


The reason why continuous presentation of this 
representation was stopped. Either: 

- representation switch (not relevant in case of progressive 
download) 

- rebuffering 

- user request 

- switch from unicast to broadcast 

- switch from broadcast to unicast 

- end of period 

- end of content 

- end of a metrics collection period 

- failure 



NOTE: The trace may include entries for different representations that overlap in time, because multiple 

representations are being rendered simultaneously, for example one audio and one video representation. 

10.2.8 MPD Information 

This metric can be used to report Representation information from the MPD, so that reporting servers without direct 
access to the MPD can understand the used media characteristics. 

The metric is reported whenever the client sends any other quality metrics report containing references to a 
Representation which MPD information has still not been reported. 

Table 3 1 defines the MPD information for quality reporting. 
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Table 31 : MPD Information for Quality Reporting 



Key 


Type 


Description 


MPDInf ormation 


Object 






representationid 


String 


Value of Representationoid for the representation 
addressed by the QoE metrics report. 




subreplevel 


Integer 


If present, value of SubRepresentation@level for the 
subrepresentation addressed by the QoE metrics report. If 
not present, the QoE metrics report concerns the 
representation as a whole. 




Mpdinf o 


Represent at ionType 


Provides the IVIPD information for the representation or 
subrepresentation identified by representationid and 
subreplevel, if present. The following attributes and 
elements shall be present within mpdinfo if they are 
present for the identified representation or 
subrepresentation and their values shall be identical to 
those presented in the IVIPD: ©bandwidth, 
@qualityRanking, @width, (©height, @mimeType, and 
@codecs. 



1 0.3 Quality Metrics for Progressive Download 

The following metrics shall be supported by progressive download clients supporting the QoE reporting feature: 
List of HTTP Request/Response Transactions (Section 10.2.2), 

- Average Throughput (Section 10.2.4), 

- Initial Playout Delay (Section 10.2.5), 

- Buffer Level (Section 10.2.6), 

- Play List (Section 10.2.7). 

1 0.4 Quality IVIetrics for DASH 

The following metrics shall be supported by 3GP-DASH clients supporting the QoE reporting feature: 
List of HTTP Request/Response Transactions (Section 10.2.2), 
List of of Representation Switch Events (Section 10.2.3), 

- Average Throughput (Section 10.2.4), 

- Initial Playout Delay (Section 10.2.5), 

- Buffer Level (Section 10.2.6), 

- Play List (Section 10.2.7), and 

- MPD Information (Section 10.2.8). 

The ometrics attribute contains a list of quality metric keys listing all metrics that the DASH shall collect and report. 

The semantics of the attributes within the Metrics element are provided in Table 32. The XML-syntax of a Metrics 
element is provided in Table 33. 

Table 32: Semantics of Metrics element 



Element or Attribute Name 


Use 


Description 


Metrics 




DASH metric element 


©metrics 


M 


This attribute lists all quality metrics (as a list of quality 
metric keys as defined in section 1 0.2, separated by a 
whitespace) that the client shall report. 
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Element or Attribute Name 


Use 


Description 






Certain keys allow specifying a measurement interval or 
period over which a single value of the metric is derived and 
potentially also other parameters controlling the collection of 
the metrics. The parameters, if any, are included in 
parenthesis after the key and their semantics are specified in 
clause 10.2 with the metric definition itself. 


Range 


0..N 


When specified, it indicates the time period during which 
quality metric collection is requested. When not present, 
quality metric collection is requested for the whole duration 
of the content. 


@starttime 





When specified, it indicates the start time of the quality 
metric collection operation. When not present, quality metric 
collection is requested from the beginning of content 
consumption. For services with MPDotype ' Live ' , the start 
time of quality metric collection can be obtained in wallclock 
time by adding the value of this attribute indicated in media 
time to the value of the MPDOavailabilityStartTime 
attribute. For services with MPDotype ' onDemand ■ , the 
start time is indicated in media time and is relative to the 
PeriodStart \.\me of the first period in this IVIPD. 


©duration 





When specified, it indicates the duration of the quality metric 
collection interval. The value of this attribute is expressed in 
media time. 


Reporting 


1...N 


Descriptor that provides information about the requested 
Quality Reporting method and formats. See clause 10.6 for 
the 3GP-DASH quality reporting schemes. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @. 



Table 33: XML-Syntax of Metrics element 



<!-- QoE Collection and Reporting --> 
<xs : complexType nam- "MetricsType" > 
<xs : sequence> 

<xs:element iame="Reporting" type="DescriptorType" maxOccurs="unbounded"/> 
<xs:element name^"Range" ■e="RangeType" minOccurs="0" maxOccurs= "unbounded" /> 
<xs : any namespac- "##other" orocessContents="lax" minOccurs="0" maxOccur; "unbounded"/> 
</xs : sequence> 

<xs : attribute name= "metrics" typ- "xs: string" use="required"/> 
<xs : anyAttribute namespace="##other" p_ ocessContents = "lax"/> 
</xs : complexType > 

<xs : complexType name "RangeType"> 

<xs : sequence> 

<xs : any namespaC' "##other" orocessContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 

</xs : sequence> 

<xs :attribute name="startTime" type="xs lunsignedlnt" "optional"/> 

<xs :attribute name= "duration" t ■ •' >i-"xs iduration" ^.^^ "required"/> 

<xs : anyAttribute namespace="##other" processContentS; "lax"/> 
</xs : complexType> 



1 0.5 Quality Reporting Scheme for DASH 

This section specifies a 3GP-DASH quality reporting scheme. 

The quality reporting scheme is signaled using in the Reporting element in the Metrics element. The URN to be 
used for the Reporting@schemeIdUri shall be "urn : 3GPP : ns : PSS : DASH : QMIO". 

The reporting scheme shall use the quality reporting protocol defined in section 10.6. 

The semantics and XML syntax of the scheme information for the 3GP-DASH quality reporting scheme are specified in 
Table 34 and Table 35, respectively. 
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Table 34: Semantics of Quality Reporting Scheme Information 



Element or Attribute Name 


Use 


Description 


@apn 





This attribute gives the access point that should be used for 
sending the QoE reports. 


®f ormat 





This field gives the requested format for the reports. 
Possible formats are: 'uncompressed' and 'gzip'. 


©samplepercentage 





Percentage of the clients that should report QoE. The client 
uses a random number generator with the given percentage 
to find out if the client should report or not. 


©report ingserver 


M 


The reporting server URL to which the reports will be sent. 


©report inginterval 





Indicates the time(s) reports should be sent. If not present, 
then the client should send a report after the streaming 
session has ended. If present, @reportinginterval=n 
indicates that the client should send a report every n-th 
second provided that new metrics information has become 
available since the previous report. 


Legend: 

For attributes: M=Mandatory, 0=Optional, OD=Optional with Default Value, CM=Conditionally Mandatory. 
For elements: <minOccurs>...<maxOccurs> (N=unbounded) 
Elements are bold; attributes are non-bold and preceded with an @ 



Table 35: Syntax of Quality Reporting Scheme Information 



<?xml version="l . 0"?> 

<xs : schema targetNamespace="urn:3GPP :ns :PSS lAdaptiveHTTPSt reaming: 20 9 :qm" 

at tributeFormDefault= "ungual if led" 

elementFormDefault= "qualified" 

xmlns :xs = "http : //www.wS . org/2 00 1/XMLSchema" 

xmlns :xlink^"http : //www.w3 .org/1999/xlink" 

xmlns="urn: 3GPP:ns :PSS lAdaptiveHTTPStreaming: 20 9 :qm"> 

<xs : annotation> 

<xs : appinfo>3GPP DASH Quality Reporting</xs : appinfo> 

<xs : documentation xml : lang="en" > 

This Schema defines the quality reporting scheme information for 3GPP DASH. 

</xs : documentation> 
</xs : annotation> 



<xs : element 



" ThreeGPQual i tyReport ing " 



"SimpleQualityReportingType"/; 



<xs : complexType .._.::i<_ "SimpleQualityReportingType" > 

<xs :attribute name="apn" type="xs : string" use="optional"/> 

<xs : attribute name="format" tvr)p = "FormatType" use = "optional"/> 

<xs : attribute name="samplePercentage" type="xs idouble" use="optional"/> 

<xs : attribute name="reportingServer" type = "xs : anyURI" i'=i-"required"/> 

<xs : attribute name="reportingInterval" type="xs lunsignedlnt" "optional"/> 

</xs : complexType> 

<xs : simpleType _ ^ "FormatType" > 

<xs : restriction base="xs : string" > 

<xs : enumeration valu^^ "uncompressed" /> 
<xs : enumeration > . !, "gzip" /> 
</xs : restriction> 
</xs : simpleType> 

</xs : schema> 



10.6 Quality Reporting Protocol 
10.6.1 General 

The quality reporting protocol consists of: 

The report format defined in section 10.6.2 
The reporting protocol defined in section 10.6.3 
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10.6.2 Report Format 

The QoE report is formatted as an XML document that complies with the following XML schema: 



;?xml version="l . 0"?> ^^^^^ 

<xs:schema xmlns:xs "http://www.w3.org/2001/XMLSchema" 

"urn: 3gpp : metadata : 2 011 :HSD: receptionreport" 
"urn: 3gpp : metadata : 2011 :HSD: receptionreport" 



"qualified" > 



<xs : element 



"ReceptionReport" 



"ReceptionReportType" /; 



Baxc 



nax( 



<xs : complexType 
<xs : choice> 

<xs : element 
" unbounded " / > 

<xs : any 
" unbounded " / > 
</xs : choice> 
<xs : attribute 
<xs : attribute name 
</xs : complexType> 

<xs : complexType " 
<xs : sequence> 
<xs : element 
" unbounded " / > 

<xs : any 
" unbounded " / > 
</xs : sequence> 
<xs : attribute 
<xs : attribute ::■:•':■: 
<xs : attribute name 
<xs : anyAttribute 
</xs : complexType> 
<xs : complexType 
<xs : choice> 

<xs : element 
<xs : element 
<xs : element 
<xs : element 
<xs : element 
<xs : element 
<xs : element 
</xs : choice> 
<xs : anyAttribute 
</xs : complexType> 



"ReceptionReportType" > 



"QoeReport" 


"QoeReportType" "0" 


"##other" 


"skip" "0" 


"contentURI" 


"xs:anyURI" "required" /> 


: = "clientID" 


"xs: string" "optional" /> 


QoeReportType" > 




"QoeMetric" 


"QoeMetricType" "1" 


"##other" 


"skip" "0" 



"periodic" 
"reportTime" 
■- " reportPeriod" 



QoeMetricType" 



"XS: string" " required" /> 
"xs :dateTime" "required"/> 

"xs :unsignedlnt" " required" /> 
"skip"/> 



"HttpList" "HttpListType"/> 
"RepSwitchList" "RepSwitchListType"/> 
"AvgThroughput" "AvgThroughputType" 
"InitialPlayoutDelay" "xs :unsignedlnt'' 
"Buf ferLevel" "Buf f erLevelType"/> 
"PlayList" "PlayListType"/> 
"MPDInformation" "MpdInformationType" 



" unbounded " / > 



" unbounded " / > 



proc^i 



'skip"/; 



<xs : complexType 
<xs : choice> 

<xs : element 
</xs : choice> 
<xs : anyAttribute 

</xs : complexType> 

<xs : complexType 
<xs : choice> 

<xs : element 
</xs : choice> 
<xs : attribute .i.i,. 
<xs : attribute name= 
name = 
name = 



"HttpListType" > 

"HttpList Entry" 

"skip"/; 



"HttpListEntryType" maxOcc 



"unbounded"/; 



"HttpListEntryType" > 

" Trace " " HttpThroughputTraceType " 



" unbounded " / > 



<xs : attribute 
<xs : attribute 
<xs : attribute name 
<xs : attribute name 



"type" 
"url" 

"actualUrl" 
"range" 
"t request" 
<xs : attribute name="tresponse" 
<xs : attribute n3rre="responsecode 
<xs : attribute "interval" 
<xs : anyAttribute 
</xs : complexType> 



!"tcpid" "xs :unsignedlnt" "optional"/> 

"ExtensibleHttpEntryResourceType" "optional"/; 
XS: string" " required" /> 

"XS: string" "optional"/> 
"XS: string" "optional"/> 

"xs :dateTime" "required"/> 
"xs :dateTime" "required"/> 

"xs :unsignedlnt" "optional" /> 
"xs :unsignedlnt" "optional" /> 
skip"/> 



<xs : simpleType "HttpEntryResourceType"> 
<xs : restriction "xs: string" > 
<xs : enumeration "MPD"/> 
<xs : enumeration "MPDDeltaFile"/> 
<xs : enumeration "XLinkExpansion"/> 
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<XB : enumeration 
<xs : enumeration 
<xs : enumeration 
</xs : restriction> 
</xs : simpleType> 



"InitializationSegment"/> 
" IndexSegment " / > 
"MediaSegment " / > 



<xs : simpleType "StringPatternType" > 
<xs : restriction "xs: string" > 

<xs: pattern "x:\S.*"/> 
</xs : restriction> 

</xs : simpleType> 



<xs : simpleType 

<xs : union 
</xs : simpleType> 



"ExtensibleHttpEntryResourceType" > 

"HttpEntryResourceType StringPatternType" /> 



<xs : complexType "HttpThroughputTraceType" > 
<xs : attribute "s" "xs idateTime" 
<xs : attribute 110111== "d" "xs lunsignedlnt" 
<xs : attribute r3rT.e="b" "xs lunsignedlnt" 
<xs : anyAttribute "skip"/> 

</xs : complexType> 



" required" /> 
" required" /> 
" required" /> 



<xs : complexType 

<xs : choice> 

<xs : element 

</xs : choice> 

<xs : anyAttribute 
</xs : complexType> 

<xs : complexType 

<xs : attribute "to" 
<xs : attribute name-"mt" 
<xs : attribute r-irr,= -"t" 
<xs : anyAttribute 

</xs : complexType> 



'RepSwitchListType" > 
" RepSwitchEvent " 

"skip"/; 

"RepSwitchEventType" > 



" RepSwitchEventType " 



"unbounded"/; 



"XS: string" " required" /> 
"xs lunsignedlnt" "optional "/> 
"xs :dateTime" "optional"/> 
"skip"/> 



<xs : complexType "AvgThroughputType" > 

<xs : attribute name="numBytes" "xs lunsignedlnt" "required"/> 

<xs : attribute name- "activityTime" "xs lunsignedlnt" "required"/> 

<xs : attribute name="t" "xs idateTime" "required"/> 

<xs : attribute name= "duration" "xs lunsignedlnt" "required"/> 

<xs : attribute name="accessbearer" "xs:string" "optional"/> 

<xs : attribute i: in:e-"inactivityType" "InactivityType" "optional"/> 

<xs : anyAttribute "skip"/> 

</xs : complexType> 

<xs : simpleType "InactivityType" > 
<xs : restriction "xs: string" > 
<xs : enumeration "Pause"/> 
<xs : enumeration "Buf f erControl"/> 
<xs : enumeration "Error" /> 
</xs : restriction> 
</xs : simpleType> 

"Buf f erLevelType" > 



<xs : complexType 
<xs : choice> 

<xs : element 
laxc:: "unbounded" /> 
</xs : choice> 

<xs : anyAttribute 
</xs : complexType> 

<xs : complexType 
<xs : attribute 
<xs : attribute 
<xs : anyAttribute 

</xs : complexType> 



"Buf ferLevelEntry" 



"skip"/; 



'Buf f erLevelEntryType" > 
"t" "xs idateTime" 
"level" "xs lunsignedlnt" 
essContentf "skip"/> 



"Buf ferLevelEntry Type" 



required" /> 

" required" /> 




<xs : complexType 
<xs : choice> 

<xs : element 
</xs : choice> 

<xs : anyAttribute 
</xs : complexType> 

<xs : complexType 



"PlayListType" > 

"Trace" "PlayListEntryType" 

"skip"/> 

"PlayListEntryType" > ^^^^^^^^^^^ 



" unbounded " / > 
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"TraceEntry" 



<xs : choice> 

<xs : element 
</xs : choice> 
<xs : attribute 
<xs : attribute i-a 
<xs : attribute !^ame = "startType" 
<xs : anyAttribute 
</xs : complexType> 



"PlayListTraceEntryType" 



"start" 
--"mstart" 



"xs idateTime" 
"xs lunsignedlnt" 

"StartType" 
"skip"/> 



" required" /> 

" required" /> 
" required" /> 



<xs : complexType 



"PlayListTraceEntryType" > 
<xs : attribute ii<aLh=-"representationId" "xsistring" "optional"/> 
<xs : attribute narT"='-"subrepLevel" "xs lunsignedlnt" "optional" /> 
"start" "xs idateTime" "required"/> 
"mstart" "xs lunsignedlnt" " required" /> 
"duration" "xs lunsignedlnt" "required"/> 
"playbackSpeed" "xsidouble" "optional"/> 
"stopReason" "StopReasonType" "optional"/> 
"skip"/> 



<xs : attribute 
<xs : attribute 
<xs : attribute 
<xs : attribute 
<xs : attribute 
<xs : anyAttribute 
</xs : complexType> 



<xs : simpleType "StartType" > 

<xs : restriction "xs: string" > 

<xs : enumeration "NewPlayoutRequest"/> 
<xs : enumeration "Resume" /> 
<xs : enumeration "OtherUserRequest"/> 
<xs : enumeration "StartOfMetricsCollectionPeriod"/> 
</xs : restriction> 
</xs : simpleType> 

<xs : simpleType "StopReasonType" > 
<xs : restriction "xs: string" > 



<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
<xs : enumeration 
</xs : restriction> 
</xs : simpleType> 



<xs : complexType 
<xs:choice> 

<xs : element 
</xs : choice> 
<xs : attribute 
<xs : attribute 
<xs : anyAttribute 

</xs : complexType> 

<xs : complexType 
<xs : attribute 
<xs : attribute 
<xs : attribute name 
<xs : attribute name 
<xs : attribute name 
<xs : attribute i- ir: 
<xs : attribute 
<xs : anyAttribute 

</xs : complexType> 



<xs : simpleType 

<xs : list 
</xs : simpleType> 

<xs : simpleType 

<xs : list 
</xs : simpleType> 



[</xs : schema> 




"RepresentationSwitch" /> 
"Rebuf f ering"/> 
"UserRequest " / > 
"UnicastToBroadcastSwitch"/> 
"BroadcastToUnicastSwitch" > 
"EndOf Period" /> 
"EndOf Content "/> 

"EndOfMetricsCollectionPeriod"/> 
"Failure"/> 



"Mpdinf ormationType" > 

"Mpdinfo" "Represent at ionType" 



"representationid" "xs : string" 

'subrepLevel" "xs lunsignedlnt" 

"skip"/> 



1 0.6.3 Reporting Protocols 



If a specific metrics server has been configured, the client shall send QoE reports using the HTTP (RFC 2616) [9] 
POST request carrying XML formatted metadata in its body. 
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An example QoE reporting based on HTTP POST request signalling is shown below: 



■post http://www.exampleserver.com HTTP/ 1.1 

Host: 192.68.1.1 

User-Agent: Mozilla/4.0 {compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0) 
LContent-Type : text/xml; charset=utf -8 

-ontent-Length: 4408 

l<?xml versions" 1 . 0" ?> 

|<receptionReport "http : //www. example . com/content/content .mpd" 

"urn: 3gpp : metadata : 2011 :HSD : receptionreport" > 
<qoeReport "Periodl" "2011-02-16T09 : 00 : 00" 

<qoeMetric> 

<HttpList> 

<HttpListEntry "MPD" "http : //www. example . com/content/content .mpd" 
"2011-02-16T08:59:30" "2011-02-16T08 : 59 : 31" /> 

<HttpListEntry " InitializationSegment " 
"http: //www. example.com/content/initRepl . 3gp" "2011-02-16T08 : 59 :40" 

"2011-02-16T08:59:41" /> 

<HttpListEntry "InitializationSegment" 
"http: //www. example.com/content/initRep2 . 3gp" "2 011-02-16T08 : 59 :41" 

|respo;.. "2011-02-16T08 : 59 : 42" /> 

<HttpListEntry "InitializationSegment" 
"http: //www. example.com/content/initRep3 . 3gp" "2011-02-16T08 : 59 :42" 

"2011-02-16T08:59:43" /> 
</HttpList> 
</qoeMetric> 
<qoeMetric> 

<InitialPlayoutDelay>10 0</InitialPlayoutDelay> 
</qoeMetric> 
</qoeReport> 

<qoeReport "Periodl" 

<qoeMetric> 

<Buf f erLevel> 

<Buf f erLevelEntry 
<Buf f erLevelEntry 
</Buf f erLevel> 
</qoeMetric> 
<qoeMetric> 

<RepSwitchList> 

<RepSwitchEvent 
<RepSwitchEvent 
</RepSwitchList> 
</qoeMetric> 
</qoeReport> 
l</receptionReport> 



"Rep2"/; 
"Rep3"/; 




"2011-02-16T09:08:20 



"2011-02-16T09:08:19" level. "84673"/> 
"2011-02-16T09:08:20" level. " 93 8 74 "/> 
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Annex A (informative): 
Example DASH Client Behaviour 

A.1 Introduction 

The information on client behaviour is purely informative and does not imply any normative procedures on DASH 
client implementations. 



A.2 Overview 



A 3GP-DASH client is guided by the information provided in the MPD. This example assumes that the MPD@type is 
'dynamic'. The behaviour in case MPD@type being 'static' is basically a subset of the description here. 

The description in this Annex assumes that the client has access to the MPD at time FetchTime, at its initial location if 
no MPD . Location element is present, or at a location specified in any present MPD . Location element. FetchTime 
is defined at the client as the time at which the server processes the request for the MPD, but should take into account 
delay due to MPD delivery and processing. The fetch is considered successful either if the client obtains an updated 
MPD or the client verifies that the MPD has not been updated since the previous fetching. 

The following example client behaviour is expected to provide a continuous streaming service to the user: 

1) The client parses the MPD, selects a set of Adaptation Sets suitable for its environment based on information 
provided in each of the AdaptationSet elements. The selection of Adaptation Sets may also take into 
account information provided by the AdaptationSet@group attribute. 

2) Within each Adaptation Set the client selects one specific Representation, typically based on the value of the 
©bandwidth attribute, but also taking into account client decoding and rendering capabilities. Then it creates a 
list of accessible Segments for each Representation for the actual client-local time NOW measured in wall-clock 
time taking into account the procedures introduced in clause A. 3. 

3) The client accesses the content by requesting Segments or byte ranges of Segments. The client requests the 
Media Segments of the selected Representations by using the generated Segment list. 

4) The client buffers media of for at least value of ©minBuf f erTime attribute duration before starting the 
presentation. Then, once identified a Stream Access Point (SAP) for each of the media streams in the different 
Representations, it starts rendering (in wall-clock-time) of this SAP not before 

MPD@availabilityStartTime + PeriodStart -hTsap and not after MPD@availabilityStartTime 
+ PeriodStart -hTsap + @timeShif tBuf f erDepth provided the observed throughput remains at or above 
the sum of the ©bandwidth attributes of the selected Representations (if not, longer buffering may be needed). 
For services with MPD©type= ' dynamic ' , rendering the SAP at the sum of PeriodStart -t-TsAP and the value 
of MPD©suggestedPresentationDelay is recommended, especially of synchronized play-out with other 
devices adhering to the same rule is desired. 

5) Once the presentation has started, the client continues consuming the media content by continuously requesting 
Media Segments or parts of Media Segments . The client may switch Representations taking into account 
updated MPD information and/or updated information from its environment, e.g. change of observed throughout. 
With any request for a Media Segment containing a Stream Access Point, the client may switch to a different 
Representation. Seamless switching can be achieved as the different Representations are time-aligned. 
Advantageous switching points are announced in the MPD and/or in the Segment Index, if provided. 

6) With the wall-clock time NOW advancing, the client consumes the available Segments. As NOW advances the 
client possibly expands the list of available Segments for each Representation according to the procedures 
specified in clause A. 3. If the following conditions are both true, an updated MPD should be fetched: 

a) the ©mediaPresentationDuration attribute is not declared or if any media described in the MPD 
does not reach to the end of the Media Presentation, and 
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b) the current playback time gets within a threshold (typically described by at least the sum of the value of the 
©minBuf f erTime attribute and the value of the ©duration attribute on Representation level) of the 
media described in the MPD for any consuming or to be consumed Representation. 

7) If the clauses in 6) are true, the client should fetch a new MPD and update fetch time FetchTime. Once received 
the client now takes into account the possibly updated MPD and the new FetchTime in the generation of the 
accessible Segment Lists. 

In the following a brief overview on Segment list generation, seeking, support for trick modes and switching 
Representations are provided. 



A.3 Segment List Generation 
A.3.1 General 

Assume that the 3GP-DASH client has access to an MPD. This clause describes how a client may generate a Segment 
list for one Representation as shown in Table A.l from an MPD obtained at FetchTime at a specific client-local time 
NOW. In this description, the term NOW is used to refer to 'the current value of the clock at the reference client when 
performing the construction of an MPD Instance from an MPD'. A client that is not synchronised with a DASH server, 
which is in turn is expected to be synchronised to UTC, may experience issues in accessing Segments as the Segment 
availability times provided by the server and the local time NOW may not be synchronized. Therefore, 3GP-DASH 
cUents are expected to synchronize their clocks to a globally accurate time standard. 

Table A.l : Segment List 



Parameter Name 


Cardinality 


Description 


Segments 


1 


Provides the Segment URL list. 


InitializationSegment 


0, 1 


Describes tlie Initialization Segment. If not present each Media Segment is 
self-initializing. 


URL 


1 


The URL where to access the Initialization Segment (the client would 
restrict the URL with a byte range if one is provided in the MPD). 


MediaSegment 


1 ... N 


Describes the accessible Media Segments. 


startlime 


1 


The MPD start time of the Media Segment in the Period relative to the start 
time of Period. 


duration 


1 


The MPD duration for the Segment 


URL 


1 


The URL where to access the Media Segment possibly combined with a 
byte range. 



According to 8.4.4 there exist three different ways to describe and generate a Segment List. This description focusses on 
the first two where either a SegmentList element or a SegmentTemplate element is present. The case with a 
single Media Segment using BaseURL element and SegmentBase element is considered straightforward. 

The following rules apply to derive the Segment Information: 

If the Representation contains or inherits a SegmentTemplate element, then the procedures in clause A. 3.2 
are used to generate a list of Media Segments. 

If the Representation contains one or more SegmentList elements, providing a set of explicit URL(s) for 
Media Segments, then the procedures in clause A. 3. 3 are used to generate a list of Media Segments. 

If the MPD@type is 'dynamic ' , then the restrictions on Media Segment Lists as provided in clause A. 3.4 need 
to be taken into account. 

The client should only request Segments that are included in the Segment List at time instant NOW, i.e. Segment 
that are available at this time instant. 

A.3. 2 Template-based Generation of Media Segment List 

If the Representation contains or inherits a SegmentTemplate element, then the procedures in this clause are used to 
generate a list of Media Segment parameters, i.e. Segment URLs, MPD start times and MPD durations. 
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Assume that the Period end time documented in the current MPD with fetch time FetchTime is defined as 
PeriodEndTime. For any Period in the MPD except for the last one, the PeriodEndTime is obtained as the value of the 
PeriodStart of the next Period. For the last Period in the MPD: 

if the ©mediaPresentationDuration attribute is present, then PeriodEndTime is defined as the end time 
of the Media Presentation. 

if the ©mediaPresentationDuration attribute is not present, then PeriodEndTime is defined as 
FetchTime + ©minimumUpdatePeriod. 

For the SegmentTemplate element, the relevant identifiers are replaced in the SegmentTemplate@media. 

Assume that Media Segments within a Representation have been assigned consecutive numbers /=@startNumber , 
@startNuinber+l, ...., i.e. the first Media Segment has been assigned the index /=!, the second Media Segment has 
been assigned the index /=2, and so on. 

A valid list of Media Segments with Segment indices /, MediaSegment.StartTime[/] and MediaSegment.URL[/], 
/=@startNumber, @startNumber+l, ..., is obtained as follows using the ©duration attribute for this 
Representation: 

1) Set /=@startNumber. 

2) The MPD start time of the first Media Segment is obtained as (©startNumber -1)* ©duration, i.e. 
MediaSegment.StartTime[i] = 0. 

3) The URL of the Media Segment /, MediaSegment.URL[/], is obtained by replacing the %Number$ identifier by / 
in the string of SegmentTemplate@media. 

4) If {PeriodStart + MediaSegment.StartTime[/] + ©duration) < PeriodEndTime) 

- then 

- A new Media Segment is added to the list, i.e. / = / + 1; 

- The MPD duration is set to MediaSegment.duration[/] = ©duration 

- The MPD start time is MediaSegment.StartTime[/] = MediaSegment.StartTime[/-l] + ©duration. 

- Proceed with step 3. 

- else 

- A new Media Segment is added to the list, i.e. / = / + 1; 

- The MPD start time is MediaSegment.StartTime[!] = MediaSegment.StartTime[i-l] + ©duration. 

- The guaranteed duration is set to MediaSegment.duration[i] = PeriodEndTime - 

MediaSegment.StartTime[/] 

- The restrictions as specified in clause A.3.4 are applied for the creation of the accessible list of Media 

Segments. 

A.3.3 Playlist-based Generation of Media Segment List 

If the Representation contains or inherits one or more SegmentList elements, then the procedures specified in this 
clause apply to generate a valid list of accessible Media Segment URLs and MPD start times. 

Assume that Media Segments within a Representation have been assigned consecutive indices /=1,2,3...., i.e. the first 
Media Segment has been assigned /=!, the second Media Segment has been assigned /=2, and so on. 

A valid list of Media Segments with Segment numbers /=©startNumber, ©startNumber +1, ..., 
MediaSegment.StartTime[/] and MediaSegment.URL[i] is obtained as follows: 

1) Set i=©startNumber. 
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2) The MPD start time of the first Media Segment is 0, i.e. MediaSegment.StartTime[i] = 

3) The URL of the Media Segment /, MediaSegment.URL[/], is obtained as the SegmentURL@media attribute of 
the (/-@startlndex +l)-th SegmentUrl element in the SegmentList element taking into account URI 
reference resolution, possibly using the byte range specified in the ©media attribute of the same SegmentUrl 
element, if present. 

3) If the ©duration attribute is provided, then the MediaSegment.StartTime[/] of Media Segment / is obtained as 
(/-@startNumber-l)*@duration. If the ©duration attribute is not provided, then the 
MediaSegment.StartTime[l] of the only provided Segment is set to 0. 

4) If this is not the last SegmentUrl element, a new Media Segment is added to the list, i.e. / = / + 1, and proceed 
with step 2; Otherwise proceed with step 5. 

5) The restrictions as specified in clause A. 3. 4 are applied for the creation of the accessible list of Media Segments. 



A.3.4 Media Segment List Restrictions 



The Media Segment List is restricted to a list of accessible Media Segments, which may be a subset of the Media 
Segments of the complete Media Presentation. The construction is governed by the current value of the clock at the 
chent NOW. 

Generally, Segments are only available for any time NOW between ©availabilityStartTime and 
©availabilityEndTime. For times A^OW outside this window, no Segments are available. 

In addition, for services with MPD@type= 'dynamic ', assume the variable CheckTime associated to an the MPD 
with FetchTime is defined as: 

If the @minimumUpdatePeriod attribute is provided, then the check time is defined as the sum of the fetch 
time of this operating MPD and the value of this attribute, i.e. CheckTime = FetchTime + 

©minimumUpdate Period. 

If the @minimumUpdatePeriod attribute is not provided, external means are used to determine CheckTime, 
such as a priori knowledge, or HTTP cache headers, etc. 

The CheckTime is defined on the MPD-documented media time axis; when the client"s playback time reaches or gets 
close to CheckTime - MPD@minBuf f erTime it should fetch a new MPD. 

Then, the Media Segment list is further restricted by the CheckTime together with the MPD attribute 
MPD@timeShif tBuf f erDepth such that only Media Segments for which the sum of the start time of the Media 
Segment and the Period start time falls in the interval [NOW - MPD@timeShif tBuf f erDepth - 
Segmentinf o@duration, imn{CheckTime, NOW)] are included. 



A.4 Seeking 



Assume that a client attempts to seek to a specific Media Presentation time Tm in a Representation relative to the 
PeriodStart time. According to 9.4. L2, the presentation times within each Period are relative to the PeriodStart time of 
the Period minus the value of the @presentationTimeOf f set, Tq, of the containing Representation. 

Based on the MPD, the client has access to the MPD start time and Media Segment URL of each Segment in the 
Representation. The Segment number of the Segment most likely to contain media samples for Media Presentation time 
I'm is obtained as the maximum Segment index /*, for which the MPD start time MediaSegment[/].StartTime is smaller 
or equal to Tm and the start of the retrieved Segment is always available. 

Note that timing information in the MPD may be approximate due to issues related to placement of Stream Access 
Points, alignment of media tracks and media timing drift. As a result, the Segment identified by the procedure above 
may begin at a time slightly after tp and the media data for presentation time Tu may be in the previous Media Segment. 
In case of seeking, either the seek time may be updated to equal the first sample time of the retrieved file, or the 
preceeding file may be retrieved instead. However, note that during continuous playout, including cases where there is a 
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switch between alternative versions, the media data for the time between T^ and the start of the retrieved Segment is 
always available. 

For accurate seeking to a presentation time Tm, the 3GP-DASH Client needs to access Stream Access Points (SAPs). To 
determine the SAPs in a Media Segment in case of 3GP-DASH, the client may, for example, use the information in the 
Segment Index if present to locate the stream access points and the corresponding presentation time in the Media 
Presentation. In the case that a Segment is a 3GPP movie fragment, it is also possible for the client to use information 
within the "moof" and "mdat" boxes, for example, to locate SAPs and obtain the necessary presentation time from the 
information in the movie fragment and the Segment start time derived from the MPD. If no SAP with presentation time 
before the requested presentation time Tm is available, the client may either access the previous Segment or may just use 
the first representation access point as the seek result. When Media Segments start with a SAP, these procedures are 
simplified. 

Also note that not necessarily all information of the Media Segment needs to be downloaded to access the presentation 
time Ty[. The client may for example initially request the Segment Index from the beginning of the Media Segment 
using byte range requests. By use of the Segment Index, Segment timing can be mapped to byte ranges of the Segment. 
By continuously using HTTP partial GET requests, only the relevant parts of the Media Segment may be accessed for 
improved user experience and low start-up delays. 



A.5 Support for Trick Modes 



The client may pause or stop a Media Presentation. In this case client simply stops requesting Media Segments or parts 
thereof. To resume, the client sends requests to Media Segments, starting with the next fragment after the last requested 
fragment. 

If a specific Representation or SubRepresentation element includes the ©max P layout Rate attribute, 
then this Representation or Sub-representation may be used for the fast-forward trick mode. The client may play the 
Representation or Sub-Representation with any speed up to the regular speed times the specified OmaxPlayoutRate 
attribute with the same decoder profile and level requirements as the normal playout rate. If a specific 
Representation or SubRepresentation element includes the ©codingDependency attribute with value 
set to 'false ' , then this Representation or Sub -representation may be used for both fast-forward and fast-rewind trick 
modes. 

Sub-Representations in combination with Index Segments and Subsegment Index boxes may be used for efficient trick 
mode implementation. Given a Sub-Representation with the desired ©maxPlayoutRate, ranges corresponding to 
SubRepresentation@level all level values from SubRepresentation@dependencyLevel maybe 
extracted via byte ranges constructed from the information in Subsegment Index Box. These ranges can be used to 
construct more compact HTTP GET requests. 

The client may use multiple Representations to support trick mode behaviour. 



A. 6 Switching Representations 



Based on updated information during an ongoing Media Presentation, a client may decide to switch Representations. 
Switching to a 'new' Representation is equivalent to tuning in or seeking to the new Representation from the time point 
where the "old" Representation has been presented. Once switching is desired, the client should seek to a SAP in the 
'new' Representation at a desired presentation time T^ later than and close to the current presentation time. Presenting 
the 'old' Representation up to the SAP in the 'new' Representation enables seamless switching. 

If ©segmentAligment is set true and the @startWithSAP is set to 1, 2 or 3 (and in the latter case the 
Representation@inediaStreamStructureId is identical for the two Representations), then the client may 
switch at any Segment boundary by just concatenating Segments with consecutive indices from different 
Representations. No overlap downloading and decoding is required. 

The same can be achieved on Subsegment level with ©subsegment Alignment set true and 
@subsegmentStartWithSAP the same values and conditions as above. 
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A. 7 Reaction to Error Codes 



The HTTP Streaming client provides a streaming service to the user by issuing HTTP requests for Segments at 
appropriate times. The HTTP Streaming client may also update the MPD by using HTTP requests. In regular operation 
mode, the server typically responds to such requests with status code 200 OK (for regular GET) or status code 206 
Partial Content (for partial GET) and the entity corresponding to the requested resource. Other Successful 2xx or 
Redirection 3xx status codes may be returned. 

HTTP requests may result in a Client Error 4xx or Server Error 5xx status code. Some guidelines are provided in this 
clause as to how an HTTP client may react to such error codes. 

If the HTTP Client receives an HTTP client or server error (i.e. messages with 4xx or 5xx error code), the client should 
respond appropriately to the error code. 

If the HTTP Client receives a repeated HTTP error for the request of an MPD, the appropriate response may involve 
terminating the streaming service. 

If the HTTP Client receives an HTTP client error (i.e. messages with 4xx error code) for the request of an Initialization 
Segment, the Period containing the Initialization Segment may not be available anymore or may not be available yet. In 
this case the client should check if the precision of the time synchronization to a globally accurate time standard is 
sufficiently accurate. In case of repeated errors, the client should check for an update of the MPD. 

If the HTTP Client receives an HTTP client error (i.e. messages with 4xx error code) for the request of a Media 
Segment, the requested Media Segment may not be available anymore or may not be available yet. In this case the 
client should check if the precision of the time synchronization to a globally accurate time standard is sufficiently 
accurate. In case of repeated errors, the client should check for an update of the MPD. 

Upon receiving server errors (i.e. messages with 5xx error code), the client should check for an update of the MPD. The 
client may also check for alternative representations that are hosted on a different server. 



A. 8 Encoder Clock Drift Control 

Non-alignment between the end of a Representation in one Period and the start time of the next Period may be caused 
by encoder clock inaccuracy. The client should align the media presentation time at each Period start. In addition, 
significant deviations of the start time of Segments to the media time should be detected and drift-compensating 
measures may be applied even before the start of the next period is reached. 

Over a longer operation time, a difference in clock accuracy of the encoder and decoder may cause the playback to lag 
behind real-time or to interrupt temporarily due to the client trying to access data faster than real-time. Clients may 
avoid these anomalies by using the Producer Reference Time boxes as defined in clause G.5 as follows. The pace rl of 
the encoder clock in relation to the UTC is recovered from Producer Reference Time boxes. If the relative pace rl is 
less than 1, equal to 1, or greater than 1, the encoder clock runs more slowly than the UTC, at an identical pace 
compared to the UTC, or faster than the UTC, respectively. The pace r2 of the receiver playout clock in relation to UTC 
is created by accessing a UTC source. A timescale multiplication factor c is equal to rl/r2. A presentation time on a 
timeline of the receiver playout clock is derived for each sample or access unit by multiplying the composition time of 
the sample (as indicated by the file format structures) or the presentation time of the access unit (as indicated by the 
respective Program Elementary Stream header) by the timescale multiplication factor c. 
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Annex B (normative): 

Media Presentation Description Schema 

B.1 Introduction 

The main schema is provided in Annex B.2 in Table B-1. The main schema refers to the extension schema in Annex 
B.3 and in section 8.3. 

B.2 IVIain Schema 

Table B-1 : XML schema of the MPD 



<?xml version="l . 0" ?> 

<xs : schema targetNamespace= "urn :mpeg: DASH: schema :MPD : 2011" 

:,:tributeFormDefault= "unqualified" 

elementFormDefault= "qualified" 

xmlns :xs=" http://www.w3 . org/2 01/XMLSchema" 

xmlns :xlink="http : //www.w3 .org/1999/xlink" 

xmlns :x3gpp="urn:3GPP:ns : DASH: MPD- ext :2011" 

xmlns = "urn : mpeg : DASH : schema : MPD : 2 11 " > 

<xs : import namespace="http : //www.w3 .org/1999/xlink" schemaLocation=" xlink.xsd"/> 
<xs : import naiTiespace = "urn: 3GPP :ns : DASH: MPD- ext : 2011" schemaLocation="3gpp-2 011 .xsd"/> 

<xs : annotation> 

<xs : appinfoMedia Presentation Description</xs : appinfo 

<xs : documentation "en"> 

This Schema defines the Media Presentation Description. 

</xs : documentation> 
</xs : annotation> 

<!-- MPD: main element --> 

<xs:element :.,>.. "MPD" type="MPDtype"/> 

< ! - - MPD Type - - > 

<xs : complexType name="MPDtype" > 

<xs : sequence> 

<xs:element name="ProgramInformation" "ProgramlnformationType" minOccurs="0" 
maxOc ' " unbounded" /> 

<xs:element name="BasetJRL" type="BaseURLType" minOccurs="0" maxOccurs= "unbounded" /> 

<xs:element name= "Location" type="xs : anyURI" minOccurs="0" maxOccurs="unbounded"/> 

<xs: element name="Period" type="PeriodType" maxOccurs= "unbounded" /> 

<xs:element name= "Metrics" type="MetricsType" minOccurs="0" maxOccurs= "unbounded" /> 

<xs:element ref ="x3gpp :DeltaSupport" minOccurs="0"/> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccuis="unbounded"/> 

</xs : sequence> 

<xs : attribute name="id" type="xs :string"/> 

<xs : attribute name="prof iles" type="xs : string" i .= ■v-"required"/> 

<xs : attribute name="type" type = "PresentationType" ;lt_'-i,_ -"static"/> 

<xs : attribute name="availabilityStartTime" type="xs :dateTime"/> 

<xs : attribute name="availabilityEndTime" Lyije = "xs :dateTime"/> 

<xs : attribute name="mediaPresentationDuration" type="xs : duration" /> 

<xs : attribute name="minimumUpdatePeriod" type="xs : duration" /> 

<xs : attribute name="minBuf f erTime" _ype="xs : duration" use="required"/> 

<xs : attribute name="timeShif tBuf f erDepth" type="xs : duration" /> 

<xs : attribute name="suggestedPresentationDelay" type="xs : duration" /> 

<xs : attribute name="maxSegmentDuration" type="xs :duration"/> 

<xs : attribute name="maxSubsegmentDuration" type="xs : duration" /> 

<xs : anyAt tribute namespace="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Presentation Type enumeration --> 
<xs : simpleType name="PresentationType" > 
<xs : restriction base="xs : string" > 
<xs : enumeration value=" static "/> 
<xs : enumeration value= "dynamic "/> 
</xs : restriction> 
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</xs : simpleType> 

< ! -- Period --> 

<xs : complexType name =" Per iodType" > 
<xs : sequence> 

<xs:element name = "BaseURL" ::ype="BaseURLType" niinOccurs = "0" maxOccurs= "unbounded" /> 
<xs:element name="SegmentBase" type="SegmentBaseType" minOccurs="0"/> 
<xs:element name="SegmentList" type="SegmentListType" minOccurs="0"/> 
<xs : element name="SegmentTemplate" * yr :-;v^"SegmentTemplateType" minOccurs = "0"/> 
<xs:element name="AdaptationSet" r yi:.e = "AdaptationSetType" minOccurs="0" 
maxOc cur s = " unbounded " / > 

<xs : any namespace="##other" processContents="lax" minOccurs="0" 
maxOccurs= "unbounded" /> 
</xs : sequence> 

<xs : attribute ret ="xlink: href "/> 

<xs : attribute ref = "xlink: actuate" def ault=:"onRequest"/> 
<xs : attribute name="id" type="xs : string" /> 
<xs : attribute name="start" type="xs : duration" /> 
<xs : attribute name= "duration" type="xs :duration"/> 

<xs : attribute name="bitstreamSwitching" type="xs : boolean" def ault="f alse"/> 
<xs : anyAt tribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Adaptation Set --> 

<xs : complexType riame= "AdaptationSetType" > 
<xs : complexContent> 

<xs : extension base^ "RepresentationBaseType" > 
<xs : sequence> 

<xs:element name="Accessibility" type="DescriptorType" minOccurs="0" 
maxOc cur s ^ " unbounded " / > 

<xs:element name="Role" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs:element name="Rating" type="DescriptorType" minOccurs="0" 
maxOccurs^ "unbounded"/> 

<xs:element name= "Viewpoint" type="DescriptorType" minOccurs="0" 
maxOccurs- "unbounded"/> 

<xs:element name = "ContentComponent" typ'e="ContentComponentType" minOccurs="0" 
maxOccurs- "unbounded"/> 

<xs:element name="BaseURL" Lyp'e = "BaseURLType" minOccurs="0" maxOccurs^ "unbounded" /> 
<xs:element name="SegmentBase" type="SegmentBaseType" minOccurs="0"/> 
<xs:element name="SegmentList" type="SegmentListType" minOccurs="0"/> 
<xs:element name="SegmentTemplate" type="SegmentTemplateType" minOccurs="0"/> 
<xs:element name="Representation" type="RepresentationType" minOccurs="0" 
maxOccurs "unbounded" /> 
</xs : sequence> 

<xs : attribute ref ="xl ink: href" /> 

<xs : attribute ref ="xlink: actuate" def ault-"onRequest"/> 
<xs : attribute name="id" type="xs :unsignedInt"/> 
<xs : attribute name="group" type="xs :unsignedInt"/> 
<xs : attribute name="lang" type = "xs ■.language"/> 
<xs lattribute name="contentType" type="xs : string" /> 
<xs : attribute name="minBandwidth" type="xs :unsignedInt"/> 
<xs : attribute name="maxBandwidth" type="xs :unsignedInt"/> 
<xs : attribute name="minWidth" type="xs :unsignedInt"/> 
<xs : attribute name="maxWidth" type="xs :unsignedInt"/> 
<xs : attribute name="minHeight" type="xs :unsignedInt"/> 
<xs : attribute name="maxHeight" type="xs :unsignedInt"/> 
<xs : attribute name="minFrameRate" type="FrameRateType"/> 
<xs : attribute name="maxFrameRate" type="FrameRateType"/> 

<xs : attribute name="segmentAlignment" type="ConditionalUintType" def ault="f alse"/> 
<xs : attribute name="subsegmentAlignment" type="ConditionalUintType" def ault="f alse"/> 
<xs :attribute name="subsegmentStartsWithSAP" type="SAPType" def ault="0"/> 
<xs : attribute nanie = "bitstreamSwitching" type = "xs : boolean" /> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType > 

<!-- Type for Frame Rate --> 

<xs : simpleType ..^...^-"FrameRateType" > 

<xs : restriction base="xs : string" > 

<xs:pattern ..= ii;=-" [0-9] * [0-9] (/ [0-9] * [0-9] ) ?"/> 

</xs : restriction> 
</xs : simpleType> 

<!-- Conditional Unsigned Integer {unsignedint or boolean) --> 
<xs : simpleType name="ConditionalUintType" > 

<xs: union ■^■■'mberTypes="xs : unsignedint xs : boolean" /> 
</xs : simpleType> 
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<!-- Content Component --> 

<xs : complexType name="ContentComponentType" > 
<xs : sequence> 

<xs:element name= "Accessibility" typ'e="DescriptorType" minOccurs = "0" 
maxOccurt " unbounded" /> 

<xs:element name="Role" type="DescriptorType" minOccurs="0" maxOccurs= "unbounded" /> 
<xs:element name="Rating" type="DescriptorType" minOccurs="0" maxOccurs="unbounded"/> 
<xs:element name= "Viewpoint" type="DescriptorType" minOccurs="0" 
:;urs = "unbounded" /> 
<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 
</xs : sequence> 

<xs : attribute name = "id" typ'e="xs :unsignedInt"/> 
<xs : attribute name="lang" type="xs : language"/> 
<xs : attribute name="contentType" type="xs : string"/> 
<xs : anyAt tribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Representation --> 

<xs : complexType riaine = "RepresentationType" > 
<xs : complexContent> 

<xs : extension base- "RepresentationBaseType" > 
<xs : sequence> 

<xs:element name = "BaseURL" ':.ype="BaseURLType" minOccurs = "0" maxOccurs= "unbounded" /> 
<xs:element name="SubRepresentation" type="SubRepresentationType" minOccurs="0" 
maxOccurs^ "unbounded"/> 

<xs:element name = "SegmentBase" tyj:ie="SegmentBaseType" minOccurs = "0"/> 
<xs: element name="SegmentList" type="SegmentListType" minOccurs="0"/> 
<xs:element name="SegmentTemplate" type="SegmentTemplateType" minOccurs="0"/> 
</xs : sequence> 

<xs : attribute name="id" type="StringNoWhitespaceType" use= "required" /> 
<xs : attribute name = "bandwidth" type = "xs :unsignedlnt" i.::: ;-"required"/> 
<xs : attribute name="qualityRanking" type="xs :unsignedInt"/> 
<xs : attribute name="mediaStreamStructureId" ■ _ -"StringVectorType"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- String without white spaces --> 

<xs : simpleType name="StringNoWhitespaceType" > 

<xs : restriction base= "xs : string" > 

<xs:pattern " [^\r\n\t \p{z}]*"/> 

</xs : restriction> 
</xs : simpleType> 

<!-- SubRepresentation --> 

<xs : complexType name="SubRepresentationType" > 
<xs : complexContent> 

<xs : extension base= "RepresentationBaseType" > 

<xs : attribute name="level" "xs :unsignedInt"/> 
<xs : attribute name="dependencyLevel" type="UIntVectorType"/> 
<xs : attribute name= "bandwidth" type="xs :unsignedInt"/> 
<xs : attribute name =" content Component" type="StringVectorType"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType > 

<!-- Representation base {common attributes and elements) --> 
<xs : complexType ..j..;.^-" RepresentationBaseType" > 
<xs : sequence> 

<xs:element name="FramePacking" ^ r^. ^noescriptorType" minOccurs="0" 
maxOccurf "unbounded" /> 

<xs:element naiv,e^"AudioChannelConf iguration" -"DescriptorType" minOccurs = "0" 
maxOccurf "unbounded"/> 

<xs:element rianie="ContentProtection" ;,"^"pv;i "DescriptorType" minOccurs = "0" 

" unbounded" / > 
<xs : any naniespace = "##other" processContents=i"lax" minOccurs = "0" maxOccurs= "unbounded" /> 
</xs : sequence> 

<xs : attribute name="prof iles" type="xs : string"/> 
<xs : attribute name="width" r /; -.-"xs :unsignedInt"/> 
<xs : attribute name="height" typ6="xs :unsignedInt"/> 
<xs : attribute name="f rameRate" type="FrameRateType"/> 
<xs : attribute name="audioSamplingRate" typ'e="xs : string" /> 
<xs : attribute name="mimeType" type="xs : string" /> 
<xs : attribute name="codecs" ;ype="xs : string"/> 
<xs : attribute name="maximumSAPPeriod" type="xs :double"/> 
<xs lattribute name="startWithSAP" _ ype="SAPType"/> 
<xs : attribute name="maxPlayoutRate" type="xs :double"/> 
<xs : attribute name="codingDependency" ^\"-'^-"xs : boolean" /> 
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<xs : anyAttribute namespace="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Stream Access Point type enumeration --> 
<xs : simpleType name="SAPType"> 

<xs : restriction base = "xs runsignedint" > 
<xs iminlnclusive - \\\ ' -"0" /> 
<xs imaxinclusive ■iL4=-"6"/> 
</xs : restriction> 
</xs : simpleType> 

<!-- Segment information base --> 

<xs : complexType name="SegmentBaseType" > 

<xs : sequence> 

<xs:element name = "Initialization" _. y |.ie="URLType" minOccurs="0"/> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 

</xs : sequence> 

<xs : attribute name="timescale" ' ■;-- -"xs :unsignedInt"/> 

<xs : attribute name="presentationTimeOf f set" type="xs :unsignedInt"/> 

<xs : attribute name="indexRange" type="xs : string"/> 

<xs : attribute name="indexRangeExact" type="xs : boolean" def ault="f alse"/> 

<xs : anyAttribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Multiple Segment information base --> 
<xs : complexType i ;ur- - "MultipleSegmentBaseType" > 
<xs : complexContent> 

<xs : extension base="SegmentBaseType"> 

<xs : attribute name = "duration" Lyp'e="xs :unsignedInt"/> 
<xs : attribute name="startNumber" type="xs :unsignedInt"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- Segment Info item URL/range --> 
<xs : complexType name="URLType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 

</xs : sequence> 

<xs : attribute name="sourceURL" type="xs : anyURI"/> 

<xs : attribute name="range" type="xs : string"/> 

<xs : anyAttribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Segment List --> 

<xs : complexType r an .^-"SegmentListType" > 
<xs : complexContent> 

<xs : extension base= "MultipleSegmentBaseType" > 
<xs : sequence> 

<xs:element name="SegmentURL" _ ' - "SegmentURLType" minOccurs="0" 

"unbounded" /> 

</xs : sequence> 

<xs : attribute ref ="xl ink: href" /> 

<xs : attribute ref = "xlink: actuate" def ault =^"onRequest"/> 
</xs : extension> 
</xs : complexContent> 
</xs : complexType> 

<!-- Segment URL --> 

<xs : complexType name="SegmentURLType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 

</xs : sequence> 

<xs : attribute name="media" ;ype="xs : anyURI"/> 

<xs : attribute name="mediaRange" type="xs : string"/> 

<xs : attribute name="indexRange" type="xs : string"/> 

<xs : anyAttribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

<!-- Segment Template --> 

<xs : complexType "SegmentTemplateType" > 
<xs : complexContent> 

<xs : extension „a3e= "MultipleSegmentBaseType" > 
<xs : attribute name="media" _ ype="xs : string"/> 
<xs : attribute r?ne = " initialization" tyr>e=i"xs : string" /> 
</xs : extension> 
</xs : complexContent> 
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</xs : complexType> 

<!-- Whitespace-separated list of strings --> 
<xs : simpleType iiame="StringVectorType" > 

<xs : list itemType="xs : string" /> 
</xs : simpleType> 

<!-- Whitespace-separated list of unsigned integers --> 
<xs : simpleType name="UIntVectorType" > 

<xs : list itemType="xs :unsignedInt"/> 
</xs : simpleType> 

< ! -- Base URL --> 

<xs : complexType name= "BaseURLType" > 
<xs : simpleContent> 

<xs : extension base= "xs : anyURI " > 

<xs : attribute name="serviceLocation" type="xs ; string" /> 
<xs : anyAt tribute namespace="##other" processContents="lax"/> 
</xs : extension> 
</xs : simpleContent> 
</xs : complexType> 

<!-- Program Information --> 

<xs : complexType name="ProgramInformationType" > 
<xs : sequence> 

<xs:element name="Title" type="xs : string" minOccurs="0"/> 
<xs:element name="Source" type="xs : string" minOccurs="0"/> 
<xs:element name=" Copyright" type="xs : string" minOccurs "0"/> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs="unbounded"/> 
</xs : sequence> 

<xs : attribute name="lang" type="xs : language"/> 
<xs : attribute name="moreInformationURL" type="xs : anyURI"/> 

<xs : anyAttribute namespace= "##other" processContents= " lax" /> 
</xs : complexType> 

<!-- Descriptor --> 

<xs : complexType name="DescriptorType" > 

<xs : sequence> 

<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 

</xs : sequence> 

<xs : attribute name="schemeIdUri" type="xs : anyURI" use="required"/> 

<xs : attribute name="value" type="xs : string"/> 

<xs : anyAttribute name space ="##other" processContents="lax"/> 
</xs : complexType> 

< ! -- Metrics --> 

<xs : complexType name="MetricsType" > 
<xs : sequence> 

<xs: element name = "Reporting" type="DescriptorType" maxOccurs= "unbounded" /> 
<xs:element name="Range" :ype="RangeType" minOccurs="0" maxOccurs= "unbounded" /> 
<xs : any namespace="##other" processContents="lax" minOccurs="0" maxOccurs= "unbounded" /> 
</xs : sequence> 

<xs : attribute name= "metrics" type="xs : string" use="required"/> 
<xs : anyAttribute name space = "##other" P'rocessContents="lax"/> 
</xs : complexType> 

<!-- Metrics Range --> 

<xs : complexType name= "RangeType" > 

<xs : attribute name="starttime" type="xs : duration" /> 

<xs : attribute namp= "duration" type="xs :duration"/> 
</xs : complexType> 

</xs : schema> 



B.3 3GPP Extension Schema 

Table B-2: XML schema of the 3GPP Extensions for MPD 



<?xml version="l . 0"?> 

<xs : schema targetNamespac- -"urn: 3GPP :ns : DASH: MPD- ext :2 011" 

attributeFormDefault= "unqualified" elementFormDefault= "qualified" 

xmlns :xs = "http: //www.w3 . org/2001/XMLSchema" 

xmlns = "urn : 3GPP : ns : DASH : MPD- ext : 2 11 " > 

<xs : annotation> 
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<xs : appinfo>Extensions to Media Presentation Description for 3GPP</xs : appinfo> 
</xs : annotation> 

<xs:element - - "DeltaSupport" : yt-t; = "DeltaSupportType"/> 

< ! --DeltaSupport for the MPD --> 

<xs : complexType i ri!;: "DeltaSupportType"> 
<xs : sequence> 

<xs : any riamespace="##other" processContents="lax" minOccurs="0" 
maxOc cur s = " unbounded " / > 
</xs : sequence> 

<xs : attribute name="sourceURL" type="xs : anyURI" us>: = "required"/> 
<xs : attribute name="availabilityDuration" type="xs .- duration" /> 
<xs : anyAt tribute name space ="##other" processContents="lax"/> 

</xs : complexType > 

</xs : schema> 
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Annex C (normative): 
Descriptor Scheme Definitions 

C.1 Introduction 

This annex defines descriptors that are defined in this specification. In particular the following descriptors are defined 
Role descriptor scheme in clause C.2. 
Frame packing descriptor scheme in clause C.3. 



C.2 Role Descriptor Scheme 



The URN "urn:inpeg : dash: role : 2 011" is defined to identify the role scheme defined in Table C.l. Note that 
Role@value shall be assigned to Adaptation Sets that contain a media component type to which this role is 
associated. 

Table C.I — Role@value attribute for scheme with a value "urn:mpeg:dash:roie:20ii" 



RoleSvalue 


Description 


caption 


captions (see note 3 below) 


subtitle 


subtides (see note 3 below) 


main 


main media component(s) which is/are intended for presentation if no other information is 
provided 


alternate 


media content component(s) that is/are an alternative to (a) main media content component(s) 
of the same media component type (see note 2 below) 


supplementary 


media content component that is supplementary to a media content component of a different 
media component type (see Note 1 below) 


commentary 


media content component with commentary (e.g. director"s commentary) (typically audio) 


dub 


media content component which is presented in a different language from the original (e.g. 
dubbed audio, translated captions) 


NOTES 

1) A normal audio/video program labels both the primary audio and video as "main". However, when the two 

media component types are not equally important, for example (a) video providing a pleasant visual 
experience to accompany a music track that is the primary content or (b) ambient audio accompanying a 
video showing a live scene such as a sports event, that is the primary content, the accompanying media 
maybe assigned a "supplementary" role. 

2) alternate media content components should carry other descriptors to indicate in what way it differs from the 

main media content components (e.g. a Viewpoint descriptor or a Role descriptor), especially when 
multiple alternate media content components including multiple supplementary media content 
components are available. 

3) open ('burned in') captions or subtitles would be marked as media type component "video" only, but having a 

descriptor saying 'caption' or 'subtitle'; 
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C.3 Frame Packing Descriptor Scheme 

The frame packing description scheme is signalled in FramePacking elements. For Representations or Sub- 
Representations that contain a video component that conforms to ISO/IEC 14496-10 [35], the URN for 
FramePacking@scheTneIdUri shall be 

urn:Tnpeg :dash:14496:10: f raTne_packing_arrangement_type : 2011 , that is defined to indicate the 
frame-packing arrangement as defined by Table D-8 of ISO/IEC 14496-10 [35] ("Definition of 
frame_packing_arrangement_type") to be contained in FramePacking elements. The ©value shall be the "Value" 
column as specified in Table D-8 of [35] and shall be interpreted according to the "Interpretation" column in the same 
table. A 3GP-DASH client supporting stereoscopic 3D video should recognize and support frame packing arrangement 
types given by values 3 and 4 for the ©value attribute, corresponding to Side-by-Side and Top-and-Bottom frame 
packing formats, respectively. 
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Annex D (informative): 
MPD Examples 



D.1 On-Demand Service 

Table D.l provides an example MPD for an On-Demand service. 

Table D.l : Example MPD for an On-Demand Service 



<?xml version="l . 0" ?> 
<MPD 

jrof iles= "urn : 3GPP : PSS : profile : DASHIO " 

type="static" 

minBuf ferTime="PT10S" 

mediaPresentationDuration="PT2H" 

availabilityStartTime="2010-04-01T09:30 :47Z" 

availabilityEndTime="2010-04-07T09:30 :47Z" 

xsi :schemaLocation= "urn :mpeg: DASH: schema : MPD : 2011 3GPP-RellO-MPD .xsd" 

xmlns :xsi="http : //www.w3 . org/2 01/XMLSchema- instance" 

xmlns = "urn : mpeg : DASH : schema : MPD : 2 11 " > 

<ProgramInformation lorelnf ormationURL="http : //www. example . com" > 

<Title>Example</Title> 
< / Programinf ormat ion> 

<BaseURL>http : //www. example . com</BaseURL> 
<Period _.r.i _ = "PTOS"> 

<AdaptationSet mimeType= "video/3gpp" > 

< Content Component contentType= "video" /> 
<ContentComponent contentType="audio" lang="en"/> 

<Representation i?odecs="s2S3 , samr" bandwidth="256000" id="256"> 
<BaseURL> " repl " </BaseURL> 

<SegmentList duration="1000" timescale="100" > 
< Initialization sourceURL="seg-init . 3gp"/> 
<SegmentURL media="seg-l . 3gp"/> 
<SegmentURL media="seg-2 . 3gp"/> 
<SegmentURL :'dia="seg-3 . 3gp"/> 
</SegmentList> 
< /Represent at ion> 

<Representation :--dscs="mp4v. 20 . 9, mp4a.El" bandwidth="128000" id="128"> 
<BaseURL> " rep2 " < /BaseURL> 
<SegmentList 'cion="10"> 

< Initialization sourceURL="seg-init . 3gp"/> 
<SegmentURL media="seg-l . 3gp"/> 
<SegmentURL media="seg-2 . 3gp"/> 
<SegmentURL media="seg-3 . 3gp"/> 
</SegmentList> 
< /Represent at ion> 
</ Adapt at ionSet> 
</Period> 

<Period :ai-T:^"PT30S" > 
<SegmentTemplate 
duration="10" 

initialization="seg-init-$RepresentationId$ . 3gp" 
- ii -I- "http : //example . com/$RepresentationId$/$Number$ . 3gp" /> 
<AdaptationSet mimeType="video/3gpp" ■ ■ -v - -"mp4v. 20 . 9, mp4a.El"> 
< Content Component contentType= "video " / > 
<ContentComponent contentType="audio" lanq="en"/> 
<Representation jandwidth="25S000" id="l"/> 
<Representation aandwidth="128000" id="2"/> 
</ Adapt at ionSet> 
</Period> 
</MPD> 



D.2 Live Service 

Table D.2 provides an example MPD for a live service. 
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Table D.2: Example MPD for a Live Service 
Table D.2: Example IVIPD for a Live Service 



<?xml version="l . 0" ?> 
<MPD 

jrof iles= "urn : 3GPP : PSS : profile : DASHIO " 

type =" dynamic" 

minBuf ferTime="PT3S" 

availabilityStartTime="2 010-04-2ST0 8 :45: 00-08: 00" 

minimumUpdatePeriod= " PT5M0S " 

timeShiftBufferDepth="PTlH3 0M0S" 

xsi :schemaLocation= "urn :mpeg: DASH: schema : MPD : 2011 3GPP-RellO-MPD .xsd" 

xmlns :xsi="http : //www.w3 . org/2 01/XMLSchema- instance" 

xmlns = "urn : mpeg : DASH : schema : MPD : 2 11 " > 

<ProgramInformation ■mationURL="http : //www. example . com" > 

<Title>Example 3: 3GPP SA4 Meeting in Vancouver as Live Broadcast</Title> 
< Source >3GPP</ Source > 
< / Programinf ormat ion> 
< Period .;i^r = "PT0S" id="0"> 

<AdaptationSet mimeType= ' video/3gpp ' codecs="avcl . 42E00B" width="320" height="240" 
coritentType= "video" > 

<SegmentTemplate 
duration="60" 

initialization="http : //www. ad- server . com/l-day-black/QVGA/0 . 3gp" 
r -iio = "http : //www. ad- server . com/l-day-black/QVGA/$Number$ . 3gp" > 
</SegmentTemplate> 

<Representation "Ad-QVGA" bandwidth="10000" > 
< /Represent at ion> 
</ Adapt at ionSet> 
</Period> 

<Period . . _ . -"PT15M0S" :'•"!"> 
<SegmentTemplate 
duration="10" 

initialization="http: //www. example . com/Period-2010- 04 -26T08 -45- 00/rep- 
$RepresentationID$/seg-0 . 3gp" 

niedia="http: //www. example . com/Period-2010 -04 -26T08 -45- 00/rep- 
$RepresentationID$/seg-$Number$ . 3gp"/> 

<AdaptationSet rriimeType= ' video/3gpp ' > 

< Content Component contentType= "video" /> 
<ContentComponent contentType="audio" lang="en"/> 
<Representation 
id="QVGA-LQ" "avcl . 42E00C, mp4a.40.2" bandwidth= "192000 " .,idth="320" height = "240 " /> 

<Representation :i-"QVGA-HQ" codecs="avcl . 42E00C, mp4a.40.2" bandwidth="384000" 
width="320" "240"/> <Representation id="VGA-LQ" mimeType= ' video/3gpp ' 
codecs = "avcl.64001E, mp4a.40.2" -/idth="512000" width="640" height = "480"/> 

<Representation "VGA-HQ" codecs="avcl . 64001E, mp4a.40.2" bandwidth="1024000" 
width= " G4 " he iqr. - - " 4 8 " / > 
< /Adapt at ionSet> 
</Period> 

<Period . _a_'_ -"PT2H01M22 . 12S" i:i="2"> 
<SegmentTemplate duration="10" 

: j --"http : //www. ad- server . com/15min-Ads/$RepresentationID$/$Number$ . 3gp" 
nitialization="http : //www. ad- server . com/15min-Ads/$RepresentationID$/0 . 3gp"/> 
<AdaptationSet mimeType= ' video/3gpp ' > 

< Content Component contentType= "video " / > 
<ContentComponent contentType="audio" lariq="en"/> 

<Representation " "QVGA" codecs="avcl . 42E00C, mp4a.40.2" bandwidth="25S000" 
width="32 0" heiqht="240"/> 

<Representation "VGA" codecs="avcl . 64001E, mp4a.40.2" bandwidth="512000" 
wl dt L- "640" 1..- ia!:^ ^ " 4 8 " / > 
</ Adapt at ionSet> 
</Period> 

<Period rart ="PT2H16M22 . 12S" " "3"> 
<SegmentTemplate 
duration="10" 

media="http: //www. example . com/Period-2010-04-26Tll-01-22/rep- 
$RepresentationID$/seg-$Number$ . 3gp" 

i.nitializatiori="http: //www. example . com/Period-2010 -04 -2STll-01-22/rep- 
$RepresentationID$/seg-0 . 3gp" 
/> 
<AdaptationSet imeType= ' video/3gpp ' contentType= "video" > 

<Representation id="QVGA-LQ" codecs="avcl . 42E00C" bandwidth^ "192000 " width="320" 
height="240"/> 

<Representation "QVGA-HQ" codecs="avcl . 42E00C" bandwidth="384000" width="320" 
height="240"/> 
<Representation "VGA-LQ" :-dprq = "avcl . 64001E" bandwidth="512000" vid\h="640" 
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height="480"/> 

<Representation ."VGA-HQ" :;odecs = "avcl . 64001E" bandwidth="1024000" 


widths 


"640" 


height = "480'7> 

</AdaptationSet> 

<AdaptationSet .meType= ' audio/3gpp ' contentType="audio" lang="en"> 
<Representation id="audio" jodecs="mp4a. 40 . 2" bandwidth="32000"/> 






<Representation '-"audio" codecs="mp4a . 40 . 2" bandwidth="64000"/> 






</ Adapt at ionSet> 
</Period> 
</MPD> 







D.3 MPD Assembly 



Table D.3 provides an example MPD with reference to external Period element as provided in Table D.4. An equivalent 
MPD to the one in Table D.3 after dereferencing with the Period element in Table D.4 is shown in Table D.2. 

Table D.3: Example MPD with reference to external Period element 



<?xml version="l . 0" ?> 
<MPD 

a-of lies = "urn : 3GPP : PSS : profile : DASHIO " 
type=" dynamic" 
minBuf ferTime="PT3S" 

availabilityStartTime="2 010-04-2 6T0 8 :45: 00-08: 00" 
minimumUpdatePeriod= " PT5M0S " 
timeShiftBufferDepth="PTlH3 0M0S" 

xsi :schemaLocation= "urn :mpeg: DASH: schema : MPD : 2011 3GPP-RellO-MPD .xsd" 
xmlns :xsi="http : //www.w3 . org/2 01/XMLSchema- instance" 
xmlns :xlink="http : //www.w3 .org/1999/xlink" 
xmlns = "urn : mpeg : DASH : schema : MPD : 2011 " > 
<ProgramInformation moreinf ormationURL="http : //www. example . com" > 

<Title>Example 3: 3GPP SA4 Meeting in Vancouver as Live Broadcast</Title> 
< Source >3GPP</ Source > 
</ProgramInformation> 
<Period -?:;--"PTOS" i:;="0"> 

<AdaptationSet mimeType= ' video/3gpp ' codecs="avcl . 42E00B" width="320" height="240" 
contentType= "video" > 

<SegmentTemplate 
;uration="SO" 

initialization="http : //www. ad- server . com/l-day-black/QVGA/0 . 3gp" 
media="http: //www. ad- server . com/l-day-black/QVGA/$Number$ . 3gp" > 
</SegmentTemplate> 

<Representation - '-"Ad-QVGA" bandwidth="10000" > 
< /Represent at ion> 
< /Adapt at ionSet> 
</Period> 

<Period :^tart = "PT15M0S" _ .-"1"> 
<SegmentTemplate 
.uration="10" 

initialization="http: //www. example . com/Period-2010- 04 -26T08 -45- 00/rep- 
$RepresentationID$/seg-0 . 3gp" 

[riedia="http: //www. example . com/Period-2010-04-26T08-45-00/rep- 
$RepresentationID$/seg-$Number$ . 3gp"/> 

<AdaptationSet aiimeType= ' video/3gpp ' > 

< Content Component contentType= "video" /> 
<ContentComponent contentType="audio" lang="en"/> 

<Representation id="QVGA-LQ" codecs="avcl . 42E00C, mp4a.40.2" bandwidth="192000" 
width="320" !-. I "240"/> 

<Representation "QVGA-HQ" codecs="avcl . 42E00C, mp4a.40.2" : awidth="384000" 
width="320" i -iJi- -^"240"/> <Representation id="VGA-LQ" mimeType= ' video/3gpp ' 
codecs="avcl.S4001E, mp4a.40.2" bandwidth= "512000 " width="S40" height="480"/> 

<Representation "VGA-HQ" ;'odecs = "avcl . S4001E, mp4a.40.2" bandwidth="1024000" 
width- "S40" "480 "/> 
</ Adapt at ionSet> 
</Period> 

<Period :link:ref="http : //www. example . com/Period. xml" ' ' "2"/> 
<Period ;tart="PT2H16M22 . 12S" id="3"> 
<SegmentTemplate 
iuration="10" 

inedia="http: //www. example . com/Period-2010-04-2STll-01-22/rep- 
$RepresentationID$/seg-$Number$ . 3gp" 

initialization="http: //www. example . com/Period-2010 -04 -26Tll-01-22/rep- 
$RepresentationID$/seg-0 . 3gp" 
/> 
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<AdaptationSet LmeType= ' video/3gpp ' contentType= "video" > 

<Representation id="QVGA-LQ" codecs="avcl . 42E00C" bandwidth^" 192000" width="320" 
height="240"/> 

<Representation "QVGA-HQ" codecs="avcl . 42E00C" bandwidth="384000" width="320" 
height="240"/> 

<Representation "VGA-LQ" :odecs = "avcl . 64001E" bandwidth="512000" ,;idi;h="640" 
height="480"/> 

<Representation "VGA-HQ" codecs="avcl . 64001E" bandwidth="1024000" width="S40" 

height="480"/> 

< /Adapt at ionSet> 

<AdaptationSet lmeType= ' audio/3gpp ' contentType="audio" lang="en"> 
<Representation id="audio" . s="mp4a. 40 . 2" bandwidth="32000"/> 
<Representation ■ "audio" codecs="mp4a . 40 . 2" bandwidth= "64000 "/> 
</ Adapt at ionSet> 
</Period> 
</MPD> 



Table D.4: External Period 



<?xml version="l . 0"?> 
<Period start="PT15M0S" > 
<SegmentTemplate 
iuration="10" 

initialization="http: //www. example . com/Period-2010- 04 -26T08 -45- 00/rep- 
$RepresentationID$/seg-0 . 3gp" 

"http : //www. example . com/Period-2010-04-26T08-45-00/rep-$RepresentationID$/seg- 
$Number$ . 3gp"/> 

<AdaptationSet niimeType= ' video/3gpp ' > 

< Content Component cont ent Type = "video" /> 
<ContentComponent contentType="audio" lang="en"/> 

<Representation id="QVGA-LQ" codecs="avcl . 42E00C, mp4a.40.2" bandwidth="192000" 
width="32 0" n^iaS"^"240"/> 

<Representation id= "QVGA-HQ" codecs="avcl . 42E00C, mp4a.40.2" bandwidth= "384000" 
width= "320" height = " 24 " / > 

<Representation id="VGA-LQ" mimeType= ' video/3gpp ' codecs="avcl . 64001E, mp4a.40.2" 
bandwidth="512000" idth="640" height="480"/> 

<Representation id="VGA-HQ" codecs="avcl . 64001E, mp4a.40.2" bandwidth="1024000" 
width="640" heig! "480"/> 

< /Adapt at ionSet> 
</Period> 



D.4 MPD Deltas 

In the following MPD example, the content is 30 minutes in duration. There are 3 Periods, each of 10 minutes duration. 
Each Period has 3 Representations and each Representation is contained within one 3gp file. Each Representation has 
audio encoded with Low Complexity- AAC. One Representation of each Period (plrepl.3gp, p2repl.3gp, and 
p3repl.3gp) has video resolution 320x240 encoded with H.264 baseline profile level 1.1. Another Representation of 
each Period (plrep2.3gp, p2rep2.3gp, and p3rep2.3gp) has resolution 320x240 encoded with H.264 baseline profile 
level 1.3. Finally, a third representation in each period (plrep3.3gp, p2rep3.3gp, and p3rep3.3gp) has resolution 
480x240 encoded with H.264 baseline profile level 2.1. One Representation of each Period has bandwidth of 239 kbps, 
a second representation has bandwidth of 478 kbps, and a third representation has bandwidth of 892 kbps. 

Since each represention is contained in one file, the Initialization Segments and the Media Segments for a 
representation are accessed with byte ranges. Each SegmentUrl element in the MPD contains a medlarange 
attribute and the corresponding byte range for the Initialization Segment or Media Segment. For the example each 
Segment of all representations is 10 seconds in duration. 

Line numbers of the MPD in the example are shown for clarity, although these would not be present in the MPD. 

EX/*lMPLE 1 (add) 

The change of adding the "Url" element for the last Segment to the Representation of the third Period with 239K 
bandwidth can be described as 

517a 

<SegmentUrl mediarange=" 17339554-17642841" /> 
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The line number of the MPD where the delta is applied is 5 17. The following line is added: 

EXAMPLE 2 (replace) 

Consider the change of replacing the line containing the DeltaSupport element in the next MPD. 

20c 

<DeltaSupport sourceURL="delta2 .mpdd" availabilityDuration="120s"/> 

EXAMPLE 3(delete) 

If lines 8 through 10 of the original MPD are deleted and not present in the updated MPD, the delta to express this is: 

8,10d 

Below is what the MPD looks like after 30 minutes. In this case, the MPD is updated approximately every 10 seconds. 



l<?xml version="l . 0" ?> 
2<MPD 

prof iles= "urn : 3GPP : PSS : profile : DASHIO " 

3 type= "dynamic" 

4 availabilityStartTime="2010-07-01T05 : 00 : OOZ" 

5 availabilityEndTime="2010-07-08T05 : 00 : OOZ" 

6 mediaPresentationDuration="PT2H" 

7 minimumUpdatePeriod="PT10S" 

8 minBuf ferTime="PT10S" 

9 timeShiftBufferDepth="PT30M" 

10 baseUrl="http : //www. example . com/" 

11 xmlns :xsi="http : //www. w3 . org/2 01/XMLSchema- instance" 

12 xmlns : x3gpp= "urn : 3GPP : ns : DASH : MPD- ext : 2 11 " 

13 xsi : schemaLocation=" urn :mpeg: DASH: schema : MPD : 2 011 3GPP-2 011 .xsd" 

14 xmlns =" urn :mpeg: DASH: schema : MPD : 2 011" > 
15 

16 <ProgramInformation moreInformationURL="http : //www. example . com" > 

17 <Title>Example</Title> 

18 <Source>Example</Source> 

19 < Copyright >Example< /Copyright > 

20 </ProgramInformation> 

21 <x3gpp : DeltaSupport sourceURL="deltal .mpdd" availabilityDuration="120s"/> 

22 <Period start="PTOS" bitstreamSwitching="true" id="0"> 

23 <AdaptationSet mimeType="video/3gpp> 

24 <ContentComponent contentType="video"/> 

25 <ContentComponent contentType=" audio" lang="en"/> 

26 <Representation id="0" bandwidth="23 9000" width="320" height="240" codecs="avcl .42E00b, 
mp4a.4 .2"> 

<BaseURL>"plrepl . 3gp"</BaseURL> 

27 <SegmentList duration="10" > 

28 <Initialization range="0-9B5" /> 

29 <SegmentUrl mediarange="986-293761" /> 

30 <SegmentUrl mediarange="293762-592501" /> 



84 <SegmentUrl mediarange="17600065-17894640" /> 

85 </SegmentList> 

86 </Representation> 

87 <Representation id="l" bandwidth= "478000" width="320" height="240" codecs="avcl .42E00d, 
mp4a.4 0.2" > 

88 <BaseURL>"plrep2 . 3gp" </BaseURL> 

89 <SegmentList duration="10"> 

90 <Initialization range="0-9B5" /> 

91 <SegmentUrl mediarange="986-586538" /> 

92 <SegmentUrl mediarange="586539-1184019" /> 



149 <SegmentUrl mediarange="35199171-35788323" /> 

150 </SegmentList> 

151 </Representation> 

152 <Representation id="2" bandwidth=" 892000" width="480" height="240" 
codecs="avcl .42E015, mp4a.4 .2"> 

153 <BaseURL>"plrep3 . 3gp" </BaseURL> 
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154 <SegmentList duration="10"> 

155 <Initialization range=" 0-985" /> 

156 <SegmentUrl mediarange="986-1093691" /> 

157 <SegmentUrl mediarange="1093692-2208656" /> 



214 <SegmentUrl mediarange="65684646-66784068" /> 

215 </SegmentList> 

216 </Representation> 
216 </AdaptationSet> 
217</Period> 

218<Period start="PT10M0S" bitstreamSwitching="true" id="l"> 
<AdaptationSet mimeType="video/3gpp> 

< Content Component contentType= "video" /> 
<ContentComponent contentType="audio" lang="en"/> 

219 <Representation id="0" bandwidth="239000" width="320" height="240" codecs="avcl . 42E00b, 
mp4a.40 .2"> 

<BaseURL>"p2repO . 3gp" </BaseURL> 

220 <SegmentList duration="10"> 

221 <Initialization range="0-9B5" /> 

222 <SegmentUrl mediarange="986-296011" /> 

223 <SegmentUrl mediarange="296012-595787" /> 



280 <SegmentUrl mediarange="17647666-17946154" /> 

281 </SegmentList> 

282 </Representation> 

283 <Representation id="l" bandwidth="478000" width="320" height="240" codecs="avcl . 42E00d, 
mp4a.40 .2"> 

284 <SegmentList duration="10"> 

285 <BaseURL>"p2repl . 3gp" </BaseURL> 

286 <Initialization range="0-9B5" /> 

287 <SegmentUrl mediarange="986-591037" /> 

288 <SegmentUrl mediarange="591038-1190590" /> 



385 <SegmentUrl mediarange="35294377-35891354" /> 

386 </SegmentList> 

387 </Representation> 

388 <Representation id="2" bandwidth=" 8 92000" width="480" height="240" codecs="avcl . 42E015 , 
mp4a.40 .2"> 

3 89 <BaseURL>"p2rep2 . 3gp" </BaseURL> 

390 <SegmentList duration="PT10S"> 

391 <Initialization range="0-985" /> 

392 <SegmentUrl mediarange="986-1102088" /> 

393 <SegmentUrl mediarange="1102089-2220920" /> 



450 <SegmentUrl mediarange="65862331-66976355" /> 

451 </SegmentList> 

452 </Representation> 

453 </AdaptationSet> 
454</Period> 

455<Period start="PT20M0S" bitstreamSwitching=" true" > 

456 <AdaptationSet mimeType="video/3gpp> 

457 <ContentComponent contentType="video"/> 

458 <ContentComponent contentType=" audio" lang="en"/> 

459 <Representation id="0" bandwidth="23 9000" width="320" height="240" 
codecs="avcl .42E00b, mp4a.4 .2"> 

460 <BaseURL>"p3repO . 3gp" </BaseURL> 

461 <SegmentList duration="10"> 

462 <Initialization range="0-9B5" /> 

463 <SegmentUrl mediarange="986-302469" /> 

464 <SegmentUrl mediarange="302470-597839" /> 



517 <SegmentUrl mediarange="17040002-17339553" /> 

518 </SegmentList> 

519 </Representation> 

520 <Representation id="l" bandwidth= "478000" width="320" height="240" codecs="avcl .42E00d, 
mp4a.4 0.2" > 
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521 


<BaseURL>"p3repl . 3gp" </BaseURL> 




522 


<SegmentList duration="10"> 




523 


<Initialization range="0-985" /> 




524 


<SegmentUrl 


mediarange="986-603953" /> 


525 


<SegmentUrl 


mediarange="603954-1194693" /> 


582 


<SegmentUrl 


mediarange="34 7 904 6-34 6 7814 9" /> 


583 


</SegmentList> 




584 


< /Represent at ion> 




585 


<Representation id="2" bandwidth=" 892000" width="480" height="240" codecs="avcl . 42E015, 


mp4a 


.40.2" > 




586 


<BaseURL>"p3rep2 . 3gp" </BaseURL> 




587 


<SegmentList duration="10"> 




588 


<Initialization range="0-985" /> 




589 


<SegmentUrl 


mediarange="986-1126190" /> 


590 


<SegmentUrl 


mediarange="112 6191-222 8575" /> 


647 


<SegmentUrl 


mediarange="63 5943 83-64 7123 74" /> 


648 


</SegmentList> 




649 


< /Represent at ion> 




650 


< /Adapt at ionSet> 




651 


</Period> 




652< 


/MPD> 





Since the value of OsourceURL in the above MPD is 'deltal . mpdd', deltal . mpdd is an empty file at the time 
of publication of the above MPD. 

The following file is deltal . mpdd after the next MPD update. Notice that clients have access to the new value of 
@sourceURL referenced by the latest MPD via the delta. 



647a 




<SegmentUrl mediarange=" 64712375-65844316 "/> 




582a 




<SegmentUrl mediarange=" 34678150-35284727" /> 




517a 




<SegmentUrl mediarange=" 17339554- 17642841 "/> 




21c 










<Delt 


aSupport sourceURL="delta2 .mpdd" availabilityDuration= 


"120s"/> 



At the next MPD update, 'deltal . mpdd' would contain the cumulative update for 2 MPD updates. 



647a 



582a 



21c 



<SegmentUrl mediarange=" 64712375-65844316 "/> 
<SegmentUrl mediarange=" 65844317-66966044 "/> 



<SegmentUrl mediarange=" 34678150-35284727 "/> 
<SegmentUrl mediarange=" 35284728 -35885833 "/> 



<SegmentUrl mediarange=" 17339554- 17642841 "/> 
<SegmentUrl mediarange=" 17642 842-179433 94 "/> 



<De It aSupport sourceURL="delta3 .mpdd" availabilityDuration="12 0s"/> 
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Annex E (normative): 
Void 
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Annex F (normative): 

OMA DM QoE Management Object 



As an alternative to configuring the QoE reporting for each session via MPD, OMA-DM can be used to specify the QoE 
configuration. If such an OMA-DM QoE configuration has been specified, it shall be evaluated by the client for all 
subsequent sessions. 

For the OMA-DM QoE configuration the parameters are specified according to the following Managed Object (MO), 
and represents the same information as specified in section 10.4 and 10.5. Version numbering is included for possible 
extension of the MO. 

The Management Object Identifier shall be: urn:oma :mo : ext-3gpp-pss-dash-qoe : 1 . 0. 

Protocol compatibility: The MO is compatible with OMA Device Management protocol specifications, version 1 .2 and 
upwards, and is defined using the OMA DM Device Description Framework as described in the Enabler Release 
Definition OMA-ERELD _DM-V1_2 [22]. 

The nodes and leaf objects as provided in Figure F. 1 shall be contained under the 3GPP_PSS_DASH_QOE node if a 
client supports the feature described in this clause. 



<X*> 



Enabled 



Servers 



APN 



Format 



Interval 



SamplePercentage 



StartTime 



Duration 



Metrics 



Ext 



Figure F.I : Nodes and leaf objects 
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Node: /<X> 

This interior node specifies the unique object id of a QoE metrics management object. The purpose of this interior node 
is to group together the parameters of a single object. 

Occurrence: ZeroOrOne 

Format: node 

Minimum Access Types: Get 

The following interior nodes shall be contained if the client supports the QoE Management Object. 

/<X>/Enabled 

This leaf indicates if QoE reporting is requested by the provider. 
Occurrence: One 
Format: bool 
Minimum Access Types: Get 

/<X>/Servers 

This leaf contains a space-separated list of servers to which the QoE reports are transmitted. It is URI addresses, e.g. 
http://qoeserver.operator.com. In case of multiple servers, the client randomly selects one of the servers from the list, 
with uniform distribution. 

Occurrence: One 

Format: chr 

Minimum Access Types: Get 

Values: URI of the servers to receive the QoE report. 

/<X>/APN 

This leaf contains the Access Point Name that should be used for establishing the PDP context on which the QoE metric 
reports will be transmitted. This may be used to ensure that no costs are charged for QoE metrics reporting. If this leaf 
is not defined then any QoE reporting is done over the default access point. 

Occurrence: ZeroOrOne 

Format: chr 

Minimum Access Types: Get 

Values: The Access Point Name 

/<X>/Format 

This leaf specifies the format of the report. If this leaf is not defined the QoE reports shall be sent uncompressed. 
Occurrence: ZeroOrOne 
Format: chr 

Minimum Access Types: Get 
Values: 'uncompressed', 'gzip' 
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/<X>/Interval 

This leaf specifies how often QoE reports shall be sent. If this leaf is not defined only one QoE report shall be sent after 
the complete session. 

Occurrence: ZeroOrOne 

Format: int 

Minimum Access Types: Get 

Values: seconds 

/<X>/SamplePercentage 

This leaf specifies the percentage of sessions for which QoE metrics shall be reported. The client evaluates a random 
number at start of each session to determine if reporting shall be done for the specific session. If this leaf is not defined 
QoE reports are sent for every session. 

Occurrence: ZeroOrOne 

Format: float 

Minimum Access Types: Get 

- Values: 0.0-100.0. 

/<X>/StartTime 

This leaf specifies when collection of QoE metrics shall start. It is specified in seconds and is relative to the start of the 
session. If this leaf is not defined, the QoE collection shall be done from the start of the session. 

Occurrence: ZeroOrOne 

Format: int 

Minimum Access Types: Get 

Values: seconds 

/<X>/Duration 

This leaf specifies for how long QoE collection shall be done. It is specified in seconds and is relative to the start time 
of QoE collection. If this leaf is not defined QoE collection shall be done until the end of the session. 

Occurrence: ZeroOrOne 

Format: int 

Minimum Access Types: Get 

Values: seconds. 

/<X>/Metrics 

This leaf specifies a list of white-space separated metrics which shall be reported, and follows the same syntax as 
specified for the "©metrics" attribute in Table 32. If this leaf is not defined no QoE reporting shall be done. 

Occurrence: ZeroOrOne 

Format: chr 

Minimum Access Types: Get 

Values: Metrics as specified in section 10.4. 
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/<X>/Ext 

The Ext node is an interior node where the vendor specific information can be placed (vendor includes application 
vendor, device vendor etc.). Usually the vendor extension is identified by vendor specific name under the ext node. The 
tree structure under the vendor identified is not defined and can therefore include one or more un-standardized sub- 
trees. 

Occurrence: ZeroOrOne 

Format: node 

Minimum Access Types: Get 
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Annex G (normative): 

File format extensions for 3GPP DASH support 

G.1 Introduction 

This clause documents extensions to the ISO base media file format [11] for the support of 3GPP DASH. It is expected 
that these boxes will be integrated in an updated version of ISO/IEC 14496-12 [11]. 



G.2 Level Assignment Box 
G.2.1 Definition 

Box Type: "leva" 

Container: Movie Extends Box ("mvex") 

Mandatory: No 

Quantity: Zero or one 

Levels specify subsets of the file. Samples mapped to level n may depend on any samples of levels m, where m <= n, 
and shall not depend on any samples of levels p, where p > n. 

Levels cannot be specified for the initial movie. When the Level Assignment box is present, it applies to all movie 
fragments subsequent to the initial movie. 

For the context of the Level Assignment box, a fraction is defined to consist of one or more Movie Fragment boxes and 
the associated Media Data boxes, possibly including only an initial part of the last Media Data Box. Within a fraction, 
data for each level shall appear contiguously. Data for levels within a fraction shall appear in increasing order of level 
value. All data in a fraction shall be assigned to levels. 

NOTE: In the context of 3G DASH, each subsegment indexed within a Subsegment Index box is a fraction. 

The Level Assignment box provides a mapping from features, such as temporal sub-sequences, to levels. A feature can 
be specified through a track or a sample grouping of a track. 

The following assignment_types are defined; assignment_type values greater than 4 are reserved, while the 
semantics for the other values are specified as follows. 

0: sample groups are used to specify levels, i.e. i.e. samples mapped to different sample group description 
indexes of a particular sample grouping lie in different levels within the identified track; other tracks are not 
affected and must have all their data in precisely one level; 

1 : as for assignment_type except assignment is by a parameterized sample group; 

2, 3: level assignment is by track (see the Subsegment Index Box for the difference in processing of these levels) 

The sequence of assignment_types is restricted to be a set of zero or more of type 2 or 3, followed by zero or more of 
exactly one type. 



G.2.2 Syntax 



aligned{8) class LevelAssignmentBox extends FullBox { "leva" , 0, 0) { unsigned int{8) level_count; 
for {j=l; j <= level_count ; j++) { 
unsigned int(32) trackid; 
unsigned int{l) padding_f lag; 
unsigned int{7) assignment_type; 
if {assignment_type == 0) 

unsigned int{32) grouping_type; 
else if {assignment_type == 1) { 
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unsigned int{32) grouping_type; 

unsigned int{32) grouping_type_parameter; 

} 

else if {assignment_type ==2) {} // no further syntax elements needed 

else if {assignment_type ==3) {} // no further syntax elements needed } 



G.2.3 Semantics 

level_count specifies the number of levels each fraction is grouped into. level_count shall be greater than 
or equal to 2. 

track_id for loop entry j specifies the track identifier of the track assigned to level j. 

padding_f lag equal to 1 indicates that a conforming fraction can be formed by concatenating any positive 

integer number of levels within a fraction and padding the last Media Data box by zero bytes up to the full size 
that is indicated in the header of the last Media Data box. The semantics of padding_f lag equal to are 
unspecified. 

assignment_type indicates the mechanism used to specify the assignment to a level. assignment_type 
values greater than 3 are reserved, while the semantics for the other values are specified as follows. 

grouping_type and grouping_type_parameter, if present, specify the sample grouping used to map 
sample group description entries in the Sample Group Description box to levels. Level n contains the samples 
that are mapped to the sample group description entry having index n in the Sample Group Description box 
having the same values of grouping_type and grouping_type_parameter, if present, as those 
provided in this box. 



G.3 Subsegment Index Box 
G.3.1 Definition 

Box Type: "ssix" 
Container: File 
Mandatory: No 
Quantity: Zero or more 

The Subsegment Index box ('ssix') provides a mapping from levels (as specified by the Level Assignment box) to byte 
ranges of the indexed subsegment. In other words, this box provides a compact index for how the data in is ordered 
according to levels into partial sub-segments. It enables a client to easily access data for partial subsegments by 
downloading ranges of data in the subsegment. 

Each byte in the subsegment shall be assigned to a level. If the range is not associated with any information in the level 
assignment, then any level that is not included in the level assignment may be used. Each level shall be assigned to 
exactly one partial sub-segment, i.e. byte ranges for one level shall be contiguous. 

Samples of a partial subsegment may depend on any samples of preceding partial subsegments in the same subsegment, 
but not the other way around. For example, each partial subsegment contains samples having an identical temporal level 
and partial subsegments appear in increasing temporal level order within the subsegment. 

There may be or 1 Subsegment Index boxes per each Segment Index box that does not refer to other Segment Index 
boxes, i.e. that only indexes subsegments but no segment indexes. A Subsegment Index box, if any, shall be the next 
box after the associated Segment Index box. A Subsegment Index box documents the subsegment that is indicated in the 
immediately preceding Segment Index box. 

When a partial segment is accessed in this way, for all assignTnent_types other than 3, the final Media Data box 
may be incomplete, that is, less data is accessed than the length indication of the Media Data Box indicates is present. 
The length of the Media Data box may need adjusting, or padding used. The padding_flag in the Level Assignment Box 
indicates whether this missing data can be replaced by zeros. If not, the sample data for samples assigned to levels that 
are not accessed is not present, and care should be taken not to attempt to process such samples. 
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NOTE: assignment_type equal to 3 may be used, for example, when audio and video movie fragments 

(including the respective Media Data boxes) are interleaved. The first level can be specified to contain the 
audio movie fragments (including the respective Media Data boxes), whereas the second level can be 
specified to contain both audio and video movie fragments (including all Media Data boxes). 



G.3.2 Syntax 



aligned (8) class SubsegmentlndexBox extends FullBox { "ssix" , 0, 0) 
unsigned int{32) subsegment_count ; 



f or ( i=l; i <= subsegment_count ; i++) 
unsigned int{8) ranges__count ; 

for { j=l; j <= range s_count; j++) { 
unsigned int{8) level, - 
unsigned int{24) accumulated_level_size; 



G.3.3 Semantics 

subsegment_count is a positive integer specifying the number of subsegments for winicin partial 
subsegment information is specified in tinis box. subsegment_count sinall be equal 
ref erence_count (i.e. the number of movie fragment references) in the immediately preceding 
Segment Index box. 

ranges_count specifies the number of partial subsegment levels the media data is grouped into. This 
value shall be greater than or equal to 2. 

range_size indicates the size of the partial subsegment. 

level specifies the level to which this partial subsegment is assigned to. 



G.4 Temporal level sample grouping 
G.4.1 Definition 

Many video codecs support temporal scalability where it is possible to extract one or more subsets of frames that can be 
independently decoded. A simple case is the extraction of I frames for a bitstream with a regular I-frame interval, e.g. 
IPPPIPPP. . ., where every 4th picture is an I frame. Also subsets of these I frames can be extracted for even lower frame 
rates. More elaborate situations with several temporal levels can be constructed using hierarchical B or P frames. 

The Temporal Level sample grouping ('tele') provides a codec-independent sample grouping that can be used to group 
samples (access units) in a track (and potential track fragments) according to temporal level, where samples of one 
temporal level have no coding dependencies on samples of higher temporal levels. The temporal level equals the sample 
group description index (taking values 1, 2, 3, etc). The bitstream containing only the access units of from the first 
temporal level to a higher temporal level remains conforming to the coding standard. 

A grouping according to temporal level facilitates easy extraction of temporal subsequences, for instance using the 
Subsegment Index box in clause G.3. 



G.4.2 Syntax 



class TemporalLevelEntry { ) extends SampleGroupDescriptionEntry { ' tele ' ) 

{ 

bit (1) level_independently_decodable; 
bit(7) reserved=0; 



G.4.3 Semantics 

The temporal level of samples in a sample group equals to the sample group description index. 
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level_independently_decodable is a flag. 1 indicates tinat all samples of this level have no coding 
dependencies on samples of other levels. indicates that no information is provided. 



G.5 Producer reference box 
G.5.1 Definition 

Box Type: "prft" 
Container: File 
Mandatory: No 
Quantity: Zero or more 

The producer reference time box supplies relative wall-clock times at which movie fragments, or files containing movie 
fragments (such as segments) were produced. When these files are both produced and consumed in real time, this can 
provide clients with information to enable them to synchronize consumption with the production and thus avoid buffer 
overflow or underflow. 

This box is related to the next movie fragment box that follows it in bitstream order. It must follow any segment type or 
segment index box (if any) in the segment, and occur before the following movie fragment box (to which it refers). If a 
segment file contains any producer reference time boxes, then the first of them shall occur before the first movie 
fragment box in that segment. 

The box contains a time value measured on a clock which increments at the same rate as a UTC-synchronized NTP 
clock, using NTP format. This is associated with a media time for one of the tracks in the movie fragment. That media 
time should be in the range of times in that track in the associated movie fragment. 



G.5.2 Syntax 



aligned{8) class ProducerRef erenceTimeBox extends FullBox { "srf t" , version, 0) { 
unsigned int{32) ref erence_track_ID; 
unsigned int{S4) ntp_timestamp; 
if (version==0) 

{ 

unsigned int{32) media_time; 
} else 

{ 

unsigned int{64) media_time; 

} 
} 

G.5.3 Semantics 

ref erence_track_ID provides the track_ID for the reference track. 

ntp_timestaTnp indicates a UTC time in NTP format corresponding to decoding_time. 

media_time corresponds to the same time as ntp_timestamp, but in the time units used for the reference 
track, and is measured on this media clock as the media is produced. Note that in most cases this timestamp 
will not be equal to the timestamp of the first sample of the adjacent segment of the reference track, but it is 
recommended it be in the range of the segment containing this producer reference time box. 



G.6 Stream Access Points 
G.6.1 Introduction 

This Annex defines a Stream Access Point (SAP) and specifies six types of SAPs. 

A Stream Access Point (SAP) enables random access into a container of media stream(s). A container may contain 
more than one media stream, each being an encoded version of continuous media of certain media type. A SAP is a 
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position in a container enabling playback of an identified media stream to be started using only (a) the information 
contained in the container starting from that position onwards, and (b) possible initialization data from other part(s) of 
the container, or externally available. Derived specifications should specify if initialization data is needed to access the 
container at a SAP, and how the initialization data can be accessed. 

G.6.2 SAP properties 

For each SAP the properties, Isap, Tsap, Isau^ Tqec^ Tept, and Tpxp are identified and defined as: 

• Tsap is the earliest presentation time of any access unit of the media stream such that all access units of the 

media stream with presentation time greater than or equal to Tsap can be correctly decoded using data in 
the Bitstream starting at Isap and no data before Isap- 

• Isap is the greatest position in the Bitstream such that all access units of the media stream with presentation 

time greater than or equal to Tsap can be correctly decoded using Bitstream data starting at Isap and no data 
before Isap- 

• IsAu is the starting position in the Bitstream of the latest access unit in decoding order within the media 

stream such that all access units of the media stream with presentation time greater than or equal to Tsap 
can be correctly decoded using this latest access unit and access units following in decoding order and no 
access units earlier in decoding order. 

NOTE IsAu is always greater than or equal to Isap- 

• Tdec is the earliest presentation time of any access unit of the media stream that can be correctly decoded 

using data in the Bitstream starting at Isau and no data before Isau- 

• Tept is the earliest presentation time of any access unit of the media stream starting at Isau in the Bitstream. 

• TpxE is the presentation time of the first access unit of the media stream in decoding order in the Bitstream 

starting at Isau- 

G.6.3 SAP types 

Six types of S APs are defined with properties as follows: 

• Type 1 : Tept = Tdec = Tsap = TpxF 

• Type 2: Tept = Tdec = Tsap < Tpxp 

• Type 3: Tept < Tdec = Tsap <= Tpxp 

• Type 4: Tept <= TpTP < Tdec = Tsap 

• Type 5: Tept = Tdec < Tsap 

• Type 6: Tept < Tdec < Tsap 

NOTE The type of SAP is dependent only on which Access Units are correctly decodable and their arrangement 
in presentation order. The types informally correspond with some common terms: 

• Type 1 corresponds to what is known in some coding schemes as a 'Closed GoP random access point' (in 

which all access units, in decoding order, starting from Isap can be correctly decoded, resulting in a 
continuous time sequence of correctly decoded access units with no gaps) and in addition the access unit in 
decoding order is also the first access unit in presentation order. 

• Type 2 corresponds to what is know in some coding schemes as a 'Closed GoP random access point', for 

which the first access unit in decoding order in the media stream starting from Isau is not the first access 
unit in presentation order. 
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• Type 3 corresponds to what is known in some coding schemes as an 'Open GoP random access point', in 

which there are some access units in decoding order following Isau that cannot be correctly decoded and 
have presentation times less than Tsap- 

• Type 4 corresponds to what is known in some coding schemes as an "Gradual Decoding Refresh (GDR) 

random access point', in which there are some access units in decoding order starting from and following 
Isau that cannot be correctly decoded and have presentation times less than Tsap- 

• Type 5 corresponds to the case for which there is at least one access unit in decoding order starting from Isap 

that cannot be correctly decoded and has presentation time greater than Tdec and where Tdec is the earliest 
presentation time of any access unit starting from Isau- 

• Type 6 corresponds to the case for which there is at least one access unit in decoding order starting from Isap 

that cannot be correctly decoded and has presentation time greater than Tqec and where Tqec is not the 
earliest presentation time of any access unit starting from Isau- 
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Annex H (normative): 

MIME Type Registration for MPD 

H.1 MPD MIME Type 
H.1.1 Introduction 

This Annex provides the formal MIME type registration for the MPD. It is referenced from the registry at 
http://www.iana. org/ . 

H.1 .2 MIME Type and Subtype 

The MIME Type and Subtype are defined as follows: 

Media Type Name: application 

Subtype name: dash+xml 

Required parameters: none 

Optional parameters: The "profiles" parameter as documented in this published specification 

Encoding considerations: the same media type encoding considerations specified in section 3.2 of RFC 3023 [30] 

Security considerations: The MPD is a media presentation description and contains references to other resources. 
It is coded in XML, and there are risks that deliberately malformed XML could cause security issues. In 
addition, an MPD could be authored that causes receiving clients to access other resources; if widely distributed, 
this could be used to cause a denial-of-service attack. 

Interoperability considerations: 

The specification defines a platform-independent expression of a presentation, and it is intended that wide 
interoperability can be achieved. 

Published specification: 3GPP TS 26.247 

Applications which use this media type: various 

Additional information: 

File extension(s): mpd 

Intended usage: common 

Other Information/General Comment: none 
Person to contact for further information: 

Name: Thomas Stockhammer 

Email: stockhammer@nomor.de 
Author/Change controller: 3GPP 
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H.1.3 Parameters 

H.1 .3.1 The profiles parameter 

Parameter name: profiles 

Parameter value: The 'profiles' parameter is an optional parameter that indicates one or more profiles to which 

the file claims conformance. The contents of this attribute shall conform to either the pro- 
simp or pro- fancy productions of RFC6381 [26], Section 4.5. The profile identifiers 
reported in the MIME type parameter should match identically the profiles reported in the 
profiles attribute in the MPD itself (see clause 7.3). 

example: 

application/dash+xml ;prof iles= ' urn : 3GPP : PSS : profile : XY, urn : 3GPP : PSS : profile : X 
Z' 

H.2 Delta MPD MIME Type 
H.2.1 Introduction 

This Annex provides the formal MIME type registration for the delta MPD. It is referenced from the registry at 
http://www.iana. org/ . 

H.2.2 MIME Type and Subtype 

The MIME Type and Subtype are defined as follows: 

Media Type Name: application 

Subtype name: deltadash+xml 

Required parameters: none 

Optional parameters: none 

Encoding considerations: the same media type encoding considerations specified in section 3.2 of RFC 3023 [30] 

Security considerations: The Delta MPD is a media presentation description and contains references to other 

resources. It is coded in XML, and there are risks that deliberately malformed XML could cause security issues. 
In addition, an Delta MPD could be authored that causes receiving clients to access other resources; if widely 
distributed, this could be used to cause a denial-of-service attack. 

Interoperability considerations: 

The specification defines a platform-independent expression of a presentation, and it is intended that wide 
interoperability can be achieved. 

Published specification: 3GPP TS 26.247 

Applications which use this media type: various 

Additional information: 

File extension(s): mpdd 

Intended usage: common 

Other Information/General Comment: none 
Person to contact for further information: 
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Name: Thomas Stockhammer 
Email: stockhammer@nomor.de 
Author/Change controller: 3GPP 
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Annex I (informative): 

Signalling of DASH AVP values for QoS handling in the 

PCC 

The PCC architecture is defined in TS 23.203 [31] and provides the Rx reference point, which enables the 
application layer to authorize a specific usage. In this architecture the DASH HTTP streaming server or any 
other function in the HTTP streaming path (e.g. an HTTP proxy) can act as Application Function and interact 
with the PCRF via the Rx reference point for QoS control. It is assumed here that the AF has knowledge of the 
application type and of the MPD. 

The relevant AVPs are the ones enabling the PCRF to establish bearers with correct characteristics for DASH users. 
The AVPs are defined in TS 29.214 [33]. The further PCRF mapping from AVP to IP QoS parameter mapping is 
defined in TS 29.213 [32] 
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Table 1.1 : Example mapping of MPD parameters to Rx AVPs for 3GP-DASH (PSS) 



AVP 


Value 


Comment 


AF-Application-ldentifier 


'DASH' 


Allows to signal the DASH based application 
hence giving the opportunity to enforce 
application specific policies 


Max-Requested-Bandwidth-DL 
(N0TE1) 


B1 


Bl = sum of all MPDOmaxBandwidth 
(see clause 8.4.3.3) of all media components 
simultaneously (not mutually exclusiye) 
selectable by the DASH client plus 
HTTP/TCP/IP overhead and TCP messages 
for flow control. 

If this attribute is not present then 
Bl = sum of MPDObandwidth attributes 
of all media components of the available 
media presentation corresponding to 
representations or subrepresentations with 
highest bandwidth simultaneously selectable 
(not mutually exclusive) by the DASH client 
plus HTTP/TCP/IP overhead and TCP 
messages for flow control. 

Note: the mapping rules to derive the TCP 
message flow control bandwidth are FFS. 


Max-Requested-Bandwidth-UL 
(N0TE1) 


FFS 


For Further Study. If included, should be 
greater than or equal to IVIin-Requested- 
Bandwidth-UL 


Min-Requested-Bandwidth-DL 
(N0TE1) 


B2 


B2 = sum of all MPDOminBandwidth 
(see clause 8.4.3.3) of all media components 
simultaneously (not mutually exclusive) 
selectable by the DASH client plus 
H 1 1 P/TCP/IP overhead and TCP messages 
for flow control. 

If this attribute is not present then 
B2 = sum of MPDObandwidth attributes 
of all media components of the available 
media presentation corresponding to 
representations or subrepresentations with 
lowest bandwidth simultaneously (not 
mutually exclusive) selectable by the DASH 
client plus HTTP/TCP/IP overhead and TCP 
messages for flow control. 

Note: the mapping rules to derive the TCP 
message flow control bandwidth are FFS. 


Min-Requested-Bandwidth-UL 
(N0TE1) 


FFS 


For Further Study. Enough bitrate to cover 
TCP and HTTP GET requests. 


Flow-Description AVP 
(N0TE1) 


IP addresses and ports 





NOTE 1 : AVPs provided within the Media-Component-Description AVP, except Flow-Description AVP 
that is included within the Media-Sub-Component AVP. Omitted AVPs are not relevant for this 
functionality 
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Annex J (informative): 
Change history 



Change history 


Date 


TSG# 


TSG Doc. 


CR 


Rev 


Subject/Comment 


Old 


New 


2011-06 


52 


SP-1 10305 






Version 10.0.0 approved at TSG SA#52 




10.0.0 


2011-11 


54 


SP-1 10794 


0003 




Alignment with IVIPEG DASH 


10.0.0 


10.1.0 


2011-11 


54 


SP-1 10794 


0005 


3 


QoE Updates for Correction, Clarification and MPEG 
DASH Alignment 


10.0.0 


10.1.0 


2011-11 


54 


SP-1 10794 


0006 


2 


QoS Support for 3GP-DASH Services 


10.0.0 


10.1.0 


2012-06 


56 


SP-1 20221 


0007 


3 


Alignment with MPEG DASH 


10.1.0 


10.2.0 


2012-06 


56 


SP-1 20221 


0010 


1 


Correction of Table Reference for Change 
Commands in MPD Deltas 


10.1.0 


10.2.0 


2012-06 


56 


SP-1 20221 


0012 


2 


ContentProtection element update to signal version of 
DRM system 


10.1.0 


10.2.0 


2012-09 


57 


SP-1 20504 


0009 


3 


QoE Reporting for DASH over Combined MBMS 
Download and HTTP-based Delivery 


10.2.0 


11.0.0 


2012-09 


57 


SP-1 20509 


0014 


6 


Inclusion of MVC support for DASH 


10.2.0 


11.0.0 


2012-09 


57 


SP-1 20509 


0015 


4 


Inclusion of 3D Video Format Information in DASH 
MPD 


10.2.0 


11.0.0 


2012-12 


58 


SP-1 20761 


0016 


7 


Supporting HTTP Partial Response 


11.0.0 


11.1.0 
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History 



Document history 
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Publication 
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